L Number 


Hits 


Search Text 


DB 


Time stamp 


1 


3 


penner-reinhold.in. 


USPAT; 
US-PGPUB; 
EPO; JPO; 
DERWENT 


2004/06/29 15:25 


2 


2 


fleig-andrea.in. 


USPAT; 
US-PGPUB; 
EPO; JPO; 
DERWENT 


2004/06/29 15:25 


3 


10 


Itrpc2 


USPAT; 
US-PGPUB; 
EPO; JPO; 
DERWENT 


2004/06/29 15:26 


4 


1 


wo adj M 20002957r 


USPAT; 
US-PGPUB; 
EPO; JPO; 
DERWENT 


2004/06/29 15:51 


5 


i 


wo adj "200040614" 


USPAT; 
US-PGPUB; 
EPO; JPO; 

FM7T5 7T7XTT 

DhRWENT 


2004/06/29 15:27 


6 


2 


6548272.pn. 


USPAT; 
US-PGPUB; 
EPO; JPO; 
DERWENT 


2004/06/29 15:51 



Search History 6/29/04 4:24:58 PM Page 1 
C:\APPS\EAST\Workspaces\10007706.wsp 



10007706 Results: 



SEQ ID NO: 1 



SUMMARIES 



Result 



% 

Query 



No . 


Score 


Match 


Length DB 


ID 


Descript 


1 


7950 


100 


. 0 


1503 


3 


AAY92944 


Aay92 944 


2 


7950 


100 


.0 


1503 


3 


AAY95439 


Aay9543 9 


3 


7950 


100 


. 0 


1503 


3 


AAB36865 


Aab36865 


4 


7950 


100 


, 0 


1503 


5 


ABB76459 


Abb76459 


5 


7950 


100 


. 0 


1503 


5 


ABB84544 


Abb84544 


6 


7950 


100 


. 0 


1503 


7 


ADC47022 


Adc47022 


7 


7950 


100 


. 0 


1503 


7 


ADC77685 


Adc77685 


8 


7950 


100 , 


. 0 


1503 


7 


ADC83633 


Adc83633 


9 


7741 


97 , 


.4 


1469 


6 


ABR43185 


Abr43185 


10 


6740 . 5 


84 . 


. 8 


1508 


7 


ADC47024 


Adc47024 


11 


6709 


84 . 


.4 


1507 


7 


ADC47034 


Adc4 7034 


12 


2771 


34 . 


9 


525 


6 


ADA56830 


Ada56830 


13 


2771 


34 . 


9 


525 


6 


ABR47695 


Abr47695 


14 


2771 


34 . 


9 


525 


6 


ABR0 0046 


Abr00046 


15 


2771 


34 . 


9 


525 


7 


ADB91501 


Adb91501 


16 


2771 


34 . 


9 


525 


7 


ADC74074 


Adc74074 



Human TRP 
Human cal 
Human put 
Human Ion 
Human tra 
Human LTR 
Human 222 
LTRPC3 -re 
Human REM 
Rat LTRPC 
Mouse LTR 
Human sec 
Human sec 
Human gen 
Human sec 
Human sec 



RESULT 1 
AAY92944 

ID AAY92944 standard; protein; 1503 AA. 

XX 

AC AAY92944; 
XX 

DT 08-NOV-2000 (first entry) 
XX 

DE Human TRPC7 protein. 
XX 

KW Transmembrane protein; TRPC7 ; brain; transient receptor potential; TRP; 

KW calcium channel function; human; gene therapy; periodic psychosis; 

KW mutation. 

xx 

OS Homo sapiens . 
XX 

PN WO200029571-A1 . 
XX 

PD 25-MAY-2000. 
XX 

PF ll-NOV-1999; 99WO- JP0 062 8 9 . 
XX 

PR 12-NOV-1998; 98JP-00321200 . 
XX 

PA {EIKE ) EI KEN KAGAKU KK . 
XX 

PI Shimizu N, Nagamine K; 
XX 

DR WPI; 2000-387784/33. 

DR N-PSDB; AAA112 84 . 
XX 

PT Nucleic acids encoding transmembrane protein TRPC7 expressed in brain and 

PT homologous to transient receptor potential protein useful in the of 

PT treatment of associated diseases such as periodic psychosis . 
XX 

PS Claim 1; Page 64-71; 77pp ; Japanese. 
XX 

CC The invention relates to the isolation of a nucleic acid encoding a 

CC transmembrane protein TRPC7 which is expressed in brain and is homologous 

CC to transient receptor potential (TRP) protein. This suggests that the 

CC TRPC7 protein may have a calcium channel function. This sequence 

CC represents the human TRPC7 protein. The DNA and protein can be used in 

CC the diagnosis and treatment of disorders associated with TRPC7 , 



CC especially the screening, monitoring and treatment (by gene therapy) of 

CC periodic psychosis, which appears to be associated with mutations in the 

CC TRPC7 gene 
XX 

SQ Sequence 1503 AA; 



Query Match 100.0%; Score 7950; DB 3; Length 1503; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1503; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


l 


MEPSALRKAGSEQEEGFEGLPRROTDLGMVSNLRRSNSSLFKSWRLQCPFGNNDKQESLS 60 

1 1 1 i 1 1 1 1 1 1 i i i 1 1 1 1 i 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I 1 M 1 1 1 I 1 ! 1 1 1 1 1 1 i 1 i 1 1 1 1 1 1 
1 I I I I I I I I I ! 1 I 11 I I I I I 1 ! I 1 1 1 1 II 1 1 ! 1 1 1 1 1 [ 1 I 1 1 i 1 ! | 1 | ! | | f | M 1 I 1 I 

MEPSALRKAGSEQEEGFEGLPRRVTDLGMVSNLRRSNSSLFKSWRLQCPFGNNDKQESLS 60 


Db 


l 


Qy 


61 


SWIPENIKKKECVYFVESSKLSDAGKWCQCGYTHEQHLEEATKPHTFQGTQWDPKKHVQ 12 0 
1 1 1 1 1 1 i I I 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 ! 1 1 i 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 i 1 1 1 1 ! 1 1 1 1 1 1 1 


Db 


61 


n i i [ i i i i 1 i 1 1 1 1 1 1 1 I I I I I I I [ 1 I I I | | I 1 M 1 1 1 1 II 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 

SWIPENIKKKECVYFVESSKLSDAGKWCQCGYTHEQHLEEATKPHTFQGTQWDPKKHVQ 120 


Qy 


121 


EMPTDAFGDIVFTGLSQKVKKYVRVSQDTPSSVIYHLMTQHWGLDVPNLLISVTGGAKNF 180 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 If 1 1 1 1 1 1 1 1 1 1 I 
1 1 1 ' 1 1 1 1 1 1 1 ' 1 » 1 1 1 1 1 1 1 M 1 1 1 1 I I I | | | | | 1 | | | | | j | | 1 | | | | | | | | | [ | | | | | 

EMPTDAFGDIVFTGLSQKVKKYVRVSQDTPSSVIYHLMTQHWGLDVPNLLISVTGGAKNF 180 


Db 


121 


Qy 


181 


NMKPRLKSIFRRGLVKVAQTTGAWIITGGSHTGVMKQVGEAVRDFSLSSSYKEGELITIG 24 0 

1 1 1 1 1 1 1 II 1 1 1 1 1 11 1 1 i 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 
1 M 1 1 1 1 1 I 1 1 1 1 1 ! 1 M 1 M 1 1 M 1 1 1 1 1 1 | | | | | | | | J 1 | | | I | | | | | j (I 1 1 1 1 1 1 1 

NMKPRLKSIFRRGLVKVAQTTGAWIITGGSHTGVMKQVGEAVRDFSLSSSYKEGELITIG 24 0 


Db 


181 


Qy 


241 


VATWGTVHRREGLIHPTGSFPAEYILDEDGQGNLTCLDSNHSHFILVDDGTHGQYGVEIP 300 

1 1 1 1 1 1 1 1 1 1 i I 1 1 1 1 1 1 1 1 1 I ! 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 I 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 I I I I | | | [ | H I | | | | | | | | | | | | | | | | j | | | | | | | | 

VATWGTVHRREGLIHPTGSFPAEYILDEDGQGNLTCLDSNHSHFILVDDGTHGQYGVEIP 30 0 


Db 


241 


Qy 


301 


LRTRLEKF I SEQTKERGGVA I KI P I VC WLEGGPGTLHT IDWATTNGT PCVWEGSGRVA 360 
1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II f 1 1 1 t 1 1 1 1 1 I 1 1 1 1 1 


Db 


301 


1 ' 1 1 1 1 I 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 | ! | | j | | 
LRTRLEKF I SEQTKERGGVAIKI PI VCWLEGGPGTLHTIDNATTNGTPCVVVEGSGRVA 360 


Qy 


361 


DVIAQVANLPVSDITISLIQQKLSVFFQEMFETFTESRIVEWTKKIQDIVRRRQLLTVFR 420 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 t ! ! ! 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 t I I I I I I I I I I I I 


Db 


361 


N 1 1 1 1 1 1 1 1 f 1 1 ! 1 1 1 1 1 I 1 1 1 i 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 I I | | | | | | | | | | | 
DVIAQVANLPVSDITISLIQQKLSVFFQEMFETFTESRIVEWTKKIQDIVRRRQLLTVFR 420 


Qy 


421 


EGKDGQQDVDVAILQALLKASRSQDHFGHENWDHQLKLAVAWNRVDIARSEIFMDEWQWK 4 80 

1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 If 1 I 1 1 l l l l l l 


Db 


421 


I i I I I I I I i I I I I I 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 ! I 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 M M I I 1 1 1 1 I | | 
EGKDGQQDVDVAILQALLKASRSQDHFGHENWDHQLKLAVAWNRVDIARSEIFMDEWQWK 4 80 


Qy 


481 


PSDLHPTMTAALISNKPEFVKLFLENGVQLKEFVTWDTLLYLYENLDPSCLFHSKLQKVL 54 0 

1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 I 1 1 1 1 II 1 1 1 1 1 I II 1 1 1 1 1 
i i i i i i i i i i ( i i ii i i ii i i I I I I I I i I I \ I I II 1 1 ( 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I | | | J | 

PSDLHPTMTAALISNKPEFVKLFLENGVQLKEFVTWDTLLYLYENLDPSCLFHSKLQKVL 540 


Db 


481 


Qy 


541 


VEDPERPACAPAAPRLQMHHVAQVLRELLGDFTQPLYPRPRHNDRLRLLLPVPHVKLNVQ 6 00 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 


Db 


541 


1 1 ' 1 1 1 1 1 I 1 M M 1 1 1 1 1 1 I 1 1 1 1 1 I I I 1 | | | | | | | | | j | | | || 1 1 1 1 I I I I | | ) | (I 1 
VEDPERPACAPAAPRLQMHHVAQVLRELLGDFTQPLYPRPRHNDRLRLLLPVPHVKLNVQ 6 00 


Qy 


601 


GVSLRSLYKRSSGHVTFTMDPIRDLLIWAIVQNRRELAGIIWAQSQDCIAAALACSKILK 660 
1 1 1 1 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ( i 1 ! I 1 1 1 1 1 1 1 1 


Db 


601 


1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 | [ | 1 | | | | | | | | | 
GVSLRSLYKRSSGHVTFTMDPIRDLLIWAIVQNRRELAGIIWAQSQDCIAAALACSKILK 660 


Qy 


661 


ELSKEEEDTDSSEEMLALAEEYEHRAIGVFTECYRKDEERAQKLLTRVSEAWGKTTCLQL 720 


Db 


661 


IIIIIIIIIIIIIIIIMIIIIIIIIIMIIIMIIIIIIIIIIIIMIIIIIlllllll 

ELSKEEEDTDSSEEMLALAEEYEHRAIGVFTECYRKDEERAQKLLTRVSEAWGKTTCLQL 720 


Qy 


721 


ALEAKDMKFVSHGGIQAFLTKVWWGQLSVDNGLWRVTLCMLAFPLLLTGLISFREKRLQD 7 80 

MimmiiMMiiiiiiiiimiiiiiiMiimiiiiiiiiiiimmiii 

ALEAKDMKFVSHGGIQAFLTKWWGQLSVDNGLWRVTLCMLAFPLLLTGLISFREKRLQD 780 


Db 


721 


Qy 


781 


VGTPAARARAFFTAPVWFHLNILSYFAFLCLFAYVLMVDFQPVPSWCECAIYLWLFSLV 840 


Db 


781 


MIIMIIIIIIIII1IIII1IIIIIIIIIIIIMIIIIIMIIIIMIIIIIIIIIIII 

VGTPAARARAFFTAPVWFHLNILSYFAFLCLFAYVLMVDFQPVPSWCECAIYLWLFSLV 84 0 


Qy 


841 


CEEMRQLF YDPDECGLMKKAALYFSDFWNKLDVGAI LLFVAGLTCRLI PATLYPGRVI LS 9 00 


Db 


841 


MM ' | ; i 

CEEMRQLFYDPDECGLMKKAALYFSDFWNKLDVGAI LLFVAGLTCRLI PATLYPGRVILS 900 


Qy 


901 


LDFILFCLRLMHIFTISKTLGPKIIIVKRMMKDVFFFLFLLAVWWSFGVAKQAILIHNE 960 



Db 


901 


llllllllllllillllllllMIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIHl 

LDFILFCLRLMHIFTISKTLGPKI IIVKRMMKDVFFFLFLLAVWWSFGVAKQAILIHNE 


960 


Qy 


961 


RRVDWLFRGAVYHSYLT I FGQ I PGY I DG VNFNPEHCSPNGTDPYKPKCPESDATQQRPAF 

IIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIM 

RRVDWLFRGAVYHSYLTIFGQIPGYIDGVNFNPEHCSPNGTDPYKPKCPESDATQQRPAF 


1020 


Db 


961 


1020 


Qy 


1021 


PEWLTVLLLCLYLLFTNILLLNLLIAMFNYTFQQVQEHTDQIWKFQRHDLIEEYHGRPAA 

MMIMIMMMIIIMIIIIMIMMMMMIMIMMMIIIMMMMIM 

PEWLTVLLLCLYLLFTNILLLNLLIAMFNYTFQQVQEHTDQIWKFQRHDLIEEYHGRPAA 


1080 


Db 


1021 


1080 


Qy 


1081 


PPPFILLSHLQLFIKRWLKTPAKRHKQLKNKLEKNEEAALLSWEIYLKENYLQNRQFQO 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIM 

PPPFILLSHLQLFIKRWLKTPAKRHKQLKNKLEKNEEAALLSWEIYLKENYLQNRQFQQ 


1140 


Db 


1081 


1140 


Qy 


1141 


KQRPEQKIEDISNKVDAMVDLLDLDPLKRSGSMEQRLASLEEQVAQTARALHWIVRTLRA 

IIIIIIIIIIIIIIIIIIIIMMIIIIIIMIIIMIIIIIIIIIMlllllllMiM 

KQRPEQKIEDISNKVDAMVDLLDLDPLKRSGSMEQRLASLEEQVAQTARALHWIVRTLRA 


1200 


Db 


1141 


1200 


Qy 


1201 


SGFSSEADVPTLASQKAAEEPDAEPGGRKKTEEPGDSYHVNARHLLYPNCPVTRFPVPNE 

1 Ml IN MM II Ml Ml ' :l 1 MM 1 III Mill II II 1 1 1 , 1 ' 1 , ! 1 1 1| i| 1 1 M ] 

SGFSSEADVPTLASQKAAEEPDAEPGGRKKTEEPGDSYHVNARHLLYPNCPVTRFPVPNE 


1260 


Db 


1201 


1260 


Qy 


1261 


K V P WE T E F L I YD P P F YT A E R KD AAAMD PMGDTL E PLS T I Q YNWDG L RD RR S F HG P YT VQ 

M II IMMIMI Mill M M MMM 1 II Mill MM MIIIMI 1 1 III M 1 Ml 

KVPWETEFLIYDPPFYTAERKDAAAMDPMGDTLEPLSTIQYNWDGLRDRRSFHGPYTVQ 


1320 


Db 


1261 


1320 


Qy 


1321 


AGLPLNPMGRTGLRGRGSLSCFGPNHTLYPMVTRWRRNEDGAICRKSIKKMLEVLWKLP 

MM III 1 Ml 1 UN Mli 1 UN 1 III IIIIMIIIIIIII ! Ill 1 III, I Ml Ml 

AGLPLNPMGRTGLRGRGSLSCFGPNHTLYPMVTRWRRNEDGAICRKSIKKMLEVLWKLP 


1380 


Db 


1321 


1380 


Qy 


1381 


LSEHWALPGGSREPGEMLPRKLKRILRQEHWPSFENLLKCGMEVYKGYMDDPRNTDNAWI 

M 1 1 1 1 1 I 1 1 1 I 1 1 1 I f 1 1 1 I 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
II 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 If 1 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I I I | | | | | | j | 1 

LSEHWALPGGSREPGEMLPRKLKRILRQEHWPSFENLLKCGMEVYKGYMDDPRNTDNAWI 


1440 


Db 


1381 


1440 


Qy 


1441 


ETVAVSVHFQDQNDVELNRLNSNLHACDSGASIRWQWDRRIPLYANHKTLLQKAAAEFG 

M M II 1 1 M 1 1 M 1 1 M M M I M II II Ml M 1 M 1 MM M M 1 1 1 M 1 ,M : MM 1 M i 

ETVAVSVHFQDQNDVELNRLNSNLHACDSGASIRWQWDRRIPLYANHKTLLQKAAAEFG 


1500 


Db 


1441 


1500 


Qy 


1501 


AHY 1503 




Db 


1501 


Ml 

AHY 15 03 





RESULT 2 




AAY95439 




ID 


AAY954 3 9 standard; protein; 15 03 AA. 




XX 






AC 


AAY95439; 




XX 






DT 


10-OCT-2000 {first entry) 




XX 




DE 


Human calcium channel polypeptide. 




XX 






KW 


Human; SOC- 2/CRAC- 1 ; calcium channel; store 


operated channel; 


KW 


calcium release activated channel; therapy; 


diagnosis ; 


KW 


lymphocyte proliferative disorder. 




XX 






OS 


Homo sapiens. 




XX 






PN 


WO200040614-A2 . 




XX 






PD 


13-JUL-2000 . 




XX 






PF 


20-DEC-1999; 99WO-US02 9996 . 




XX 






PR 


30 -DECM998; 98US-0114220P . 




PR 


29-JAN-1999; 99US - 012 0018P . 




PR 


22-JUN-1999; 99US- 0140415P . 





XX 

PA (BETH-) BETH ISRAEL DEACONESS MEDICAL CENT. 
XX 

PI Scharenberg AM; 
XX 

DR WPI; 2000-465957/40. 

DR N-PSDB; AAA4 992 6 . 
XX 

PT New SOC/CRAC calcium channel polynucleotides and polypeptides used to 

PT diagnose and treat proliferative disorders associated with the channel, 

PT and to screen for novel modulators of the channel. 
XX 

PS Example; Page 69-72; 108pp; English. 
XX 

CC The present sequence is that of a human calcium channel polypeptide as 

CC deduced from a cDNA clone (see AAA49926) identified in an EST database 

CC search for sequences showing homology to Caenorhabditis elegans calcium 

CC signal sequences. Clones isolated from such screenings were used to 

CC identify human clones (see AAA49922-24) encoding members (see AAY95435- 

CC 37) of a new family of SOC (store operated channel) or CRAC (calcium 

CC release activated channel) calcium channel polypeptides. SOC/CRAC 

CC polypeptides modulate Ca2+ flux into and out of a cell, and may be 

CC activated upon depletion of Ca2 + from intracellular calcium stores, 

CC allowing Ca2+ influx into a cell. SOC/CRAC polypeptides and 

CC polynucleotides can be used to treat patients that require modulation of 

CC calcium influx into their SOC/CRAC- expressing cells. Polypeptides can be 

CC used to deliver therapeutic and/or imaging agents to such cells to 

CC modulate proliferation and growth. SOC/CRAC polypeptides also represent 

CC targets for designing and/or identifying inhibitors that block lymphocyte 

CC proliferation and binding agents that selectively bind to SOC/CRAC 

CC polypeptides to which drugs or toxins can be conjugated for delivery to 

CC SOC/CRAC expressing cells 

XX 

SQ Sequence 15 03 AA; 

Query Match 100.0%; Score 795 0; DB 3; Length 1503; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 15 03; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MEPSALRKAGSEQEEGFEGLPRRVTDLGMVSNLRRSNSSLFKSWRLQCPFGNNDKQESLS 6 0 

IIIIIIIIIIIIIIIIMIIIIIIIMIIIIMIMIIM 

Db 1 MEPSALRKAGSEQEEGFEGLPRRVTDLGMVSNLRRSNSSLFKSWRLQCPFGNNDKQESLS 60 



RESULT 3 


AAB36 865 


ID 


AAB36865 standard; protein; 15 03 AA. 


XX 




AC 


AAB36865; 


XX 




DT 


21-FEB-2001 (first entry) 


XX 




DE 


Human putative Ca2+ channel protein TrpC7 protein 


XX 




KW 


mutTCCH-1; sugar-phosphate hydrolase; NUDT9 . 


XX 




OS 


Homo sapiens . 


XX 




PN 


WO200065056-A2. 


XX 




PD 


02-NOV-20Q0 . 


XX 




PF 


26-APR-2000; 2 00 0WO-US0113 19 . 


XX 




PR 


26-APR-1999; 99US- 0131051P . 


XX 




PA 


(BETH- ) BETH ISRAEL DEACONESS MEDICAL CENT. 


XX 




PI 


Scharenberg AM; 


XX 




DR 


WPI; 2000-687347/67. 



DR N-PSDB; AAC68399. 

XX 



PT Novel nucleic acids encoding a MutT domain- containing polypeptide present 

PT in human calcium channel protein TrpC7, useful for diagnostic and 

PT therapeutic purposes. 
XX 

PS Claim 39; Page 65; 69pp ; English. 
XX 

CC The present invention relates to human mutTCCH-l. Therapeutics involving 

CC mutTCCH-l may be useful for diagnosing and treating conditions associated 

CC with aberrant levels of expression of (II) and for identifying agents 

CC that are useful for treating diseases associated with 

CC pyrophosphohydrolase and/or sugar-phosphate hydrolase activity 

XX 

SQ Sequence 1503 AA; 



Query Match 100.0%; Score 7950; DB 3; Length 1503; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1503; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


MEPSALRKAGSEQEEGFEGLPRRVTDLGMVSNLRRSNSSLFKSWRLQCPFGNNDKQESLS 60 

iimiiiiiiiMiiiiiiiMiiiiiimiiiiiiiiiiMiiiiiiimiiMii 

MEPSALRKAGSEQEEGFEGLPRRVTDLGMVSNLRRSNSSLFKSWRLQCPFGNNDKQESLS 6 0 


Db 


1 


Qy 


61 


SWI PENI KKKECVYFVE S S KLSDAGKWCQCGYTHEQHLEEATKPHTFQGTQWDPKKHVQ 12 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II i 1 t 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
' 1 1 1 1 1 1 1 1 M 1 1 ! 1 I 1 1 1 1 1 1 1 I 1 I I | | | | | 1 | | ! 1 | | M | | | | M | | | | | | | | [ | j 1 | 

SWI PENI KKKECVYFVESSKLSDAGKWCQCGYTHEQHLEEATKPHTFQGTQWDPKKHVQ 12 0 


Db 


61 


Qy 


121 


EMPTDAFGDI VFTGLSQKVKKYVRVSQDTPSSVI YHLMTQHWGLDVPNLLISVTGGAKNF 180 


Db 


121 


n.' llll'IH ' : ;,| 1 1 MM Ml MINIMUM 

EMPTDAFGDI VFTGLSQKVKKYVRVSQDTPSSVI YHLMTQHWGLDVPNLLISVTGGAKNF 180 


Qy 


181 


NMKPRLKSIFRRGLVKVAQTTGAWIITGGSHTGVMKQVGEAVRDFSLSSSYKEGELITIG 240 
1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i ! 1 1 1 1 


Db 


181 


1 1 1 1 1 1 f I 1 1 1 1 1 t 1 1 f 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 | 1 | I M 1 | 1 1 1 | | | 1 
NMKPRLKSIFRRGLVKVAQTTGAWIITGGSHTGVMKQVGEAVRDFSLSSSYKEGELITIG 24 0 


Qy 


241 


VATWGTVHRREGLIHPTGSFPAEYILDEDGQGNLTCLDSNHSHFILVDDGTHGQYGVEIP 300 

M II II IMM 1 II M 1 M 1 M MM Ml II II MM 1 III Ml II MM M II II II II 

VATWGTVHRREGLIHPTGSFPAEYILDEDGQGNLTCLDSNHSHFILVDDGTHGQYGVEIP 300 


Db 


241 


Qy 


301 


LRTRLEKFISEQTKERGGVAIKIPIVCWLEGGPGTLHTIDNATTNGTPCVWEGSGRVA 360 

MMMM II M : : M 1 1 M MM II 1 M IMMMIM Ml 1 II M IMM MMMMMM 

LRTRLEKFISEQTKERGGVAIKIPIVCWLEGGPGTLHTIDNATTNGTPCWVEGSGRVA 360 


Db 


301 


Qy 


361 


DVIAQVANLPVSDITISLIQQKLSVFFQEMFETFTESRIVEWTKKIQDIVRRRQLLTVFR 42 0 

1 1 1 1 1 , 1 1 MiMMMMMMIMMMMMMMNIMMMM 

DVIAQVANLPVSDITISLIQQKLSVFFQEMFETFTESRIVEWTKKIQDIVRRRQLLTVFR 4 2 0 


Db 


361 


Qy 


421 


EGKDGQQDVDVAILQALLKASRSQDHFGHENWDHQLKLAVAWNRVDIARSEIFMDEWQWK 4 8 0 

II 1 1! 1 M IM M 1 M M II Ml MIMMMI MM Ml 1 M M MM 1 1 II M M II 

EGKDGQQDVDVAILQALLKASRSQDHFGHENWDHQLKLAVAWNRVDIARSEIFMDEWQWK 480 


Db 


421 


Qy 


481 


PSDLHPTMTAALISNKPEFVKLFLENGVQLKEFVTWDTLLYLYENLDPSCLFHSKLQKVL 54 0 


Db 


481 


M M NMMIMMMIMMMMMMMIMMIMMMMIMMIMM 

PSDLHPTMTAALISNKPEFVKLFLENGVQLKEFVTWDTLLYLYENLDPSCLFHSKLQKVL 54 0 


Qy 


541 


VEDPERPACAPAAPRLQMHHVAQVLRELLGDFTQPLYPRPRHNDRLRLLLPVPHVKLNVQ 600 

1 M M F 1 f M 1 1 1 1 1 1 1 M 1 1 1 1 1 f 1 II 1 1 1 1 1 1 1 1 1 1 [ f [ 1 1 1 [ 1 1 1 1 1 1 1 1 1 f 1 1 1 f 1 

VEDPERPACAPAAPRLQMHHVAQVLRELLGDFTQPLYPRPRHNDRLRLLLPVPHVKLNVQ 600 


Db 


541 


Qy 


601 


GVSLRSLYKRSSGHVTFTMDPIRDLLIWAIVQNRRELAGIIWAQSQDCIAAALACSKILK 66 0 


Db 


601 


1 M 1 M III M II In M 1 MM II MMMMM M I III 1 1 1 M MM 1 1 1 IMIMII 

GVSLRSLYKRSSGHVTFTMDPIRDLLIWAIVQNRRELAGIIWAQSQDCIAAALACSKILK 660 


Qy 


661 


ELSKEEEDTDSSEEMLALAEEYEHRAIGVFTECYRKDEERAQKLLTRVSEAWGKTTCLQL 72 0 

N M 1 1 1 M 1 1 1 M 1 1 1 ! 1 1 1 1 M 1 1 1 1 1 1 1 [ 1 1 1 i 1 1 ! 1 ! M M 1 1 [ M ! ! M 1 1 1 1 1 1 

ELSKEEEDTDSSEEMLALAEEYEHRAIGVFTECYRKDEERAQKLLTRVSEAWGKTTCLQL 720 


Db 


661 


Qy 


721 


ALEAKDMKFVSHGGIQAFLTKVWWGQLSVDNGLWRVTLCMLAFPLLLTGLISFREKRLQD 78 0 

MMMMMM, ,MM| || || II MMM 1 1 ' M 1 M 1 M M M 1 Ml M II 1 M II M 

ALEAKDMKFVSHGGIQAFLTKVWWGQLSVDNGLWRVTLCMLAFPLLLTGLISFREKRLQD 78 0 


Db 


721 



Qy 781 VGTPAARARAFFTAPVWFHLNILSYFAFLCLFAYVLMVDFQPVPSWCECAIYLWLFSLV 840 

lllllllllllllllllllllllllllllllllllllllllllllllliiilllllli 

Db 781 VGTPAARARAFFTAPVWFHLNILSYFAFLCLFAYVLMVDFQPVPSWCECAIYLWLFSLV 84 0 

QY 841 CEEMRQLFYDPDECGLMKKAALYFSDFWNKLDVGAILLFVAGLTCRLIPATLYPGRVILS 900 

lllllllllllllllllllllllllllllllllllllllllll 
Db 841 CEEMRQLFYDPDECGLMKKAALYFSDFWNKLDVGAILLFVAGLTCRLIPATLYPGRVILS 900 

Qy 9 01 LDFILFCLRLMHIFTISKTLGPKIIIVKRMMKDVFFFLFLLAVWWSFGVAKQAILIHNE 960 

I M 1 1 1 III III M II I IMM M ill MM MM! II Mill II M M ll M II III 

Db 901 LDFILFCLRLMHIFTISKTLGPKIIIVKRMMKDVFFFLFLLAVWWSFGVAKQAILIHNE 960 

Qy 961 RRVDWLFRGAVYHSYLTIFGQIPGYIDGVNFNPEHCSPNGTDPYKPKCPESDATQQRPAF 102 0 

I MM MM Ml MM I II III IM MM I M II MM II II IM M MM Ii I! MM 

Db 961 RRVDWLFRGAVYHSYLTIFGQIPGYIDGVNFNPEHCSPNGTDPYKPKCPESDATQQRPAF 1020 

Qy 1021 PEWLTVLLLCLYLLFTNILLLNLLIAMFNYTFQQVQEHTDQIWKFQRHDLIEEYHGRPAA 108 0 

I Ml 1 1 II! I II I M M M IIMM Ml 1 1 II I II MMMI 1 1 1 1 M M MMI III I 

Db 1021 PEWLTVLLLCLYLLFTNILLLNLLIAMFNYTFQQVQEHTDQIWKFQRHDLIEEYHGRPAA 1080 

Qy 10 81 PPPFILLSHLQLFIKRWLKTPAKRHKQLKNKLEKNEEAALLSWEIYLKENYLQNRQFQQ 114 0 

I Mill IM 1 1 1 1 1 III M iiMI M:IM 1 1 1 II II I II M 1 1 1 1 1 1 M I M M I II 

Db 10 81 PPPFILLSHLQLFIKRWLKTPAKRHKQLKNKLEKNEEAALLSWEIYLKENYLQNRQFQQ 114 0 

Qy 1141 KQRPEQKIEDISNKVDAMVDLLDLDPLKRSGSMEQRLASLEEQVAQTARALHWIVRTLRA 1200 

1 1 M II M I II M 1 1 1 II 1 1 1 1 1 M 1 1 II 1 1 II 1 1 1 M 1 1 1 1 1 II II 1 1 III I M I Ml I 

Db 1141 KQRPEQKIEDISNKVDAMVDLLDLDPLKRSGSMEQRLASLEEQVAQTARALHWIVRTLRA 1200 

Qy 1201 SGFSSEADVPTLASQKAAEEPDAEPGGRKKTEEPGDSYHVNARHLLYPNCPVTRFPVPNE 1260 

I M M 1 1 1 1 1 i 1 1 1 ! I j ! ! 1 1 1 1 1 i 1 1 1 1 1 1 j 1 1 1 1 1 j 1 1 j j 1 1 ( 1 1 1 1 ! i | j i i ( i 1 1 1 

Db 1201 SGFSSEADVPTLASQKAAEEPDAEPGGRKKTEEPGDSYHVNARHLLYPNCPVTRFPVPNE 1260 

Qy 12 61 KVPWETEFLIYDPPFYTAERKDAAAMDPMGDTLEPLSTIQYNWDGLRDRRSFHGPYTVQ 132 0 

MM II II I UN I MMi I Mil I III 1 1 MllilMI 1 1 'III! Nil I II 

Db 12 61 KVPWETEFLIYDPPFYTAERKDAAAMDPMGDTLEPLSTIQYNWDGLRDRRSFHGPYTVQ 132 0 

Qy 1321 AGLPLNPMGRTGLRGRGSLSCFGPNHTLYPMVTRWRRNEDGAICRKSIKKMLEVLWKLP 1380 

MM IMMM IM I II M M 1 1 II I M I MM I IM II II 1 1 MMM M M IMM III 

Db 1321 AGLPLNPMGRTGLRGRGSLSCFGPNHTLYPMVTRWRRNEDGAICRKSIKKMLEVLWKLP 1380 

Qy 13 81 LSEHWALPGGSREPGEMLPRKLKRILRQEHWPSFENLLKCGMEVYKGYMDDPRNTDNAWI 144 0 

I M I II I IMIMM M II II MM I M MM II III MM II II II MM M II III 

Db 13 81 LSEHWALPGGSREPGEMLPRKLKRILRQEHWPSFENLLKCGMEVYKGYMDDPRNTDNAWI 144 0 

Qy 1441 ETVAVSVHFQDQNDVELNRLNSNLHACDSGASIRWQWDRRIPLYANHKTLLQKAAAEFG 1500 

M M III 1 1 ll II 1 1 Mill M mm' 1 ii I M || 1 1 1 1 it 1 1 1; 1 1 1; 1 1 ii i; 1 1 1 in 

Db 14 41 ETVAVSVHFQDQNDVELNRLNSNLHACDSGASIRWQWDRRIPLYANHKTLLQKAAAEFG 1500 

Qy 1501 AHY 1503 

III 

Db 1501 AHY 1503 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


7950 


100 


. 0 


1503 


4 


US- 


•09 


-600 


-087-2 


Sequence 


2, Appli 


2 


2216 


27 


. 9 


1095 


4 


us- 


•09 


-636 


-215-780 


Sequence 


78 0, App 


3 


2216 


27 


.9 


1095 


4 


us- 


09 


-685 


-166A-780 


Sequence 


780, App 


4 


2212 


27 


. 8 


1095 


3 


us- 


09 


-112 


-096-15 


Sequence 


15, Appl 


5 


2212 


27. 


. 8 


1095 


4 


us- 


09 


-636 


-215-778 


Sequence 


77 8, App 


6 


2212 


27. 


. 8 


1095 


4 


us- 


09 


-685 


-166A-778 


Sequence 


77 8, App 


7 


1423 


17. 


.9 


1533 


1 


us- 


08- 


-623 


-679-9 


Sequence 


9, Appli 


8 


1423 


17 . 


. 9 


1533 


3 


us- 


08- 


-933 


-774-9 


Sequence 


9, Appli 


9 


1423 


17 . 


. 9 


1533 


3 


us- 


09 


181 


-030-9 


Sequence 


9 , Appl i 


10 


1423 


17 . 


9 


1533 


4 


us- 


09- 


-534' 


-242-9 


Sequence 


9 , Appl i 


11 


1423 


17. 


9 


1533 


4 


us- 


09- 


-454 


-854-9 


Sequence 


9, Appli 



12 


1423 


17 


9 


1533 


4 


US- 


09- 


164 


-671 


-9 


13 


1341 


16 


9 


1497 


1 


us- 


08- 


623 


-679 


-7 


14 


1341 


16 


9 


1497 


3 


us- 


08- 


933 


-774 


-7 


15 


1341 


16 


9 


1497 


3 


us- 


09- 


181 


-030 


-7 



Sequence 9, Appli 

Sequence 7, Appli 

Sequence 7, Appli 

Sequence 7, Appli 



RESULT 1 
US-09-600-087-2 

; Sequence 2, Application US/09600087 
; Patent No. 654 82 72 
; GENERAL INFORMATION: 

; APPLICANT: Shimizu, No. 6548272uyoshi 
; APPLICANT: Nagamine , Kentaro 

; TITLE OF INVENTION: GENE CODING FOR A NOVEL TRANSMEMBRANE PROTEIN 

; FILE REFERENCE: 11283-004001 

; CURRENT APPLICATION NUMBER: US/09/600,087 

; CURRENT FILING DATE: 2000-07-11 

; PRIOR APPLICATION NUMBER: PCT/ JP99/062 8 9 

PRIOR FILING DATE: 1999-11-11 
; PRIOR APPLICATION NUMBER: JP/321200/1998 
; PRIOR FILING DATE: 1998-11-12 

NUMBER OF SEQ ID NOS : 2 
; SOFTWARE: FastSEQ for Windows Version 4.0 



SEQ ID NO 2 




LENGTH: 1503 




TYPE : PRT 




ORGANISM 


: Homo sapiens 


FEATURE : 






NAME /KEY 


TRANSMEM 




LOCATION 


(320) . . . 


(344) 


NAME /KEY 


TRANSMEM 




LOCATION 


(750) . . . 


(773) 


NAME /KEY 


TRANSMEM 




LOCATION 


(794) . . . 


(818) 


NAME/KEY 


TRANSMEM 




LOCATION 


(867) . . . 


(891) 


N7AME / KEY 


TRANSMEM 




LOCATION 


(900) . . . 


(924) 


NAME/KEY 


TRANSMEM 




LOCATION 


(932) . . . 


(956) 


NAME /KEY: 


TRANSMEM 




LOCATION: 


(1024) . . 


. (1048) 



US-09-600-087-2 



Query Match 100.0%; Score 7950; DB 4; Length 1503; 

Best Local Similarity 100.0%; Pred . No. 0; 

Matches 15 03; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


MEPSALRKAGSEQEEGFEGLPRRVTDLGMVSNLRRSNSSLFKSWRLQCPFGNNDKQESLS 


60 


Db 


1 


MMIIIMIIIIIIIIIIIIIIIIlllllllMimillllllllHIIIMIIMII 

MEPSALRKAGSEQEEGFEGLPRRVTDLGMVSNLRRSNSSLFKSWRLQCPFGNNDKQESLS 


60 


Qy 


61 


SWIPENIKKKECVYFVESSKLSDAGKWCQCGYTHEQHLEEATKPHTFQGTQWDPKKHVQ 

- • U , ' 1 ' 1 i ; , 

SWIPENIKKKECVYFVESSKLSDAGKWCQCGYTHEQHLEEATKPHTFQGTQWDPKKHVQ 


120 


Db 


61 


120 


Qy 


121 


EMPTDAFGDIVFTGLSQKVKKYVRVSQDTPSSVIYHLMTQHWGLDVPNLLISVTGGAKNF 


180 


Db 


121 


MMMIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIMIIIIIIII 

EMPTDAFGDIVFTGLSQKVKKYVRVSQDTPSSVIYHLMTQHWGLDVPNLLISVTGGAKNF 


180 


Qy 


181 


NMKPRLKSIFRRGLVKVAQTTGAWIITGGSHTGVMKQVGEAVRDFSLSSSYKEGELITIG 

IIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIINMIIIIIII 

NMKPRLKSIFRRGLVKVAQTTGAWIITGGSHTGVMKQVGEAVRDFSLSSSYKEGELITIG 


240 


Db 


181 


240 


Qy 


241 


VATWGTVHRREGLIHPTGSFPAEYILDEDGQGNLTCLDSNHSHFILVDDGTHGQYGVEIP 

MMIIIIIII IIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIM 

VATWGTVHRREGLIHPTGSFPAEYILDEDGQGNLTCLDSNHSHFILVDDGTHGQYGVEIP 


300 


Db 


241 


300 


Qy 


301 


LRTRLEKFISEQTKERGGVAIKIPIVCWLEGGPGTLHTIDNATTNGTPCVWEGSGRVA 


360 


Db 


301 


IMIIIIIIIIMIIIIIIIIIIilllllllllllllllllllllllllllll 

LRTRLEKFISEQTKERGGVAIKIPIVCWLEGGPGTLHTIDNATTNGTPCVWEGSGRVA 


360 



Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 



DVIAQVANLPVSDITISLIQQKLSVFFQEMFETFTESRIVEWTKKIQDIVRRRQLLTVFR 42 0 

MIMI ' UMI I I I M 

DVIAQVANLPVSDITISLIQQKLSVFFQEMFETFTESRIVEWTKKIQDIVRRRQLLTVFR 4 20 

EGKDGQQDVDVAILQALLKASRSQDHFGHENWDHQLKLAVAWNRVDIARSEIFMDEWQWK 480 

IIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIMIIIIIIMIIIIIMMIIHII 

EGKDGQQDVDVAILQALLKASRSQDHFGHENWDHQLKLAVAWNRVDIARSEIFMDEWQWK 4 80 
PSDLHPTMTAALISNKPEFVKLFLENGVQLKEFVTWDTLLYLYENLDPSCLFHSKLQKVL 54 0 

hi: 'M 

PSDLHPTMTAALTSNKPEFVKLFLENGVQLKEFVTWDTLLYLYENLDPSCLFHSKLQKVL 54 0 

VEDPERPACAPAAPRLQMHHVAQVLRELLGDFTQPLYPRPRHNDRLRLLLPVPHVKLNVQ 600 

IIMMM^ : I I M Ml M I I I I I IIMIMIIIIIMMII 
VEDPERPACAPAAPRLQMHHVAQVLRELLGDFTQPLYPRPRHNDRLRLLLPVPHVKLNVQ 6 00 

GVSLRSLYKRSSGHVTFTMDPIRDLLIWAIVQNRRELAGIIWAQSQDCIAAALACSKILK 660 

MIIIIIIIIIIMMMIIIIIMIMMMMMIIillMMIillllllMIIMI 

GVSLRSLYKRSSGHVTFTMDPIRDLLIWAIVQNRRELAGIIWAQSQDCIAAALACSKILK 660 
ELSKEEEDTDSSEEMLlALAEEYEHRAIGVFTECYRKDEERAQKLLTRVSEAWGKTTCLQL 72 0 

IIMIIIIIIIMIIIIIIIIMIIIIIIIIIIIIMMIIIilllMMMIIIMMI 

ELSKEEEDTDSSEEMLALAEEYEHRAIGVFTECYRKDEERAQKLLTRVSEAWGKTTCLQL 72 0 
ALEAKDMKFVSHGGIQAFLTKVWWGQLSVDNGLWRVTLCMLAFPLLLTGLISFREKRLQD 780 

MIMIIIIIIIMIIIIMIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIMIIMI 

ALEAKDMKFVSHGGIQAFLTKWWGQLSVDNGLWRVTLCMLAFPLLLTGLISFREKRLQD 780 
VGTPAARARAFFTAPWVFHLNILSYFAFLCLFAYVLMVDFQPVPSWCECAI YLWLFSLV 84 0' 

IIIIIIIIIIIIIIIIIIIII1IIIMIMIIIIIIMIIIIIIIIIIIIIIIIIIIII ' 

VGT PAARARAF FT APVWFHLNILSYFAFLCLF AY VLMVDFQPVPSWCECA I YLWLFSLV 840 



II 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 ! 1 1 1 1 1 1 1 1 1 1 



900 



960 



1 

LDFILFCLRLMHIFTISKTLGPKIIIVKRMMKDVFFFLFLLAVWWSFGVAKQAILIHNE 96 0 



lllllIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIilllllllllllllllllll 

RRVDWLFRGAVYHSYLTIFGQIPGYIDGVNFNPEHCSPNGTDPYKPKCPESDATQQRPAF 102 0 
PEWLTVLLLCLYLLFTNILLLNLLIAMFNYTFQQVQEHTDQIWKFQRHDLIEEYHGRPAA 10 8 0 

MM II II II Mill MINIMI MINIM! II III II II I IN III I III MIMII 

PEWLTVLLLCLYLLFTNILLLNLLIAMFNYTFQQVQEHTDQIWKFQRHDLIEEYHGRPAA 10 8 0 
PPPFILLSHLQLFIKRWLKTPAKRHKQLKNKLEKNEEAALLSWEIYLKENYLQNRQFQQ 114 0 

II 1 1 M 1 1 i 1 1 I II II ! 1 1 1! II II I IIIMI M 1 1 ■ 1 1 II Ml I II m:: ! M 1 1 ! 

PPPFILLSHLQLFIKRWLKTPAKRHKQLKNKLEKNEEAALLSWEIYLKENYLQNRQFQQ 114 0 
KQRPEQKIEDISNKVDAMVDLLDLDPLKRSGSMEQRLASLEEQVAQTARALHWIVRTLRA 12 00 

IIMIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

KQRPEQKIEDISNKVDAMVDLLDLDPLKRSGSMEQRLASLEEQVAQTARALHWIVRTLRA 12 0 0 

SGFSSEADVPTLASQKAAEEPDAEPGGRKKTEEPGDSYHVNARHLLYPNCPVTRFPVPNE 126 0 

I I Ml Ml I M I II II! I I MM I I I I I II M M ! M ' 'M ' = MIM I III MM!! 
SGFSSEADVPTLASQKAAEEPDAEPGGRKKTEEPGDSYHVNARHLLYPNCPVTRFPVPNE 126 0 

KVPWETE FLIYDPPF YT AE R KD AAAMD PMGDTLE PLST I QYNWDGLRDRRS FHGPYTVQ 1320 

M M II 'M. : M II M 1 1 Ml II 1 1 1 ! Mill II 1 1 1 1 II I !l I II 1 1 1 i II M I ! I 

KVPWETEFLI YD PPFYTAERKDAAAMDPMGDTLEPLST I QYNWDGLRDRRS FHGPYTVQ 13 2 0 
AGLPLNPMGRTGLRGRGSLSCFGPNHTLYPMVTRWRRNEDGAICRKSIKKMLEVLWKLP 13 8 0 

1 1 1 1 M 1 1 1 1 1 i M 1 1 1 M 1 1 1 It ! I ! 1 1 1 1 1 II M 1 1 1 1 i 1 1 1 ! 1 1 ! I i 1 i i ! ! ! 1 1 1 1 

AGLPLNPMGRTGLRGRGSLSCFGPNHTLYPMVTRWRRNEDGAICRKSIKKMLEVLWKLP 13 8 0 



Qy 



13 81 LSEHWALPGGSREPGEMLPRKLKRILRQEHWPSFENLLKCGMEVYKGYMDDPRNTDNAWI 14 40 

1 1 i I I 1 : I I I : IIIMM 



Db 



1381 LSEHWALPGGSREPGEMLPRKLKRILRQEHWPSFENLLKCGMEVYKGYMDDPRNTDNAWI 1440 



Qy 1441 ETVAVSVHFQDQNDVELNRLNSNLHACDSGAS IRWQVVDRRI PLYANHKTLLQKAAAEFG 15 0 0 

NlillllllllMIMIIilllllMIIIIIIIIIIIIIIIIIIMIIIIIIIMIIM 

Db 1441 ETVAVSVHFQDQNDVELNRLNSNLHACDSGAS IRWQWDRRI PLYANHKTLLQKAAAEFG 1500 

Qy 1501 AHY 1503 

III 

Db 1501 AHY 1503 



RESULT 2 

US-09-636-215-780 

Sequence 780, Application US/09636215 
Patent No. 6620922 
GENERAL INFORMATION : 
APPLICANT: Xu , Jiangchun 
APPLICANT: Dillon, Davin C. 
APPLICANT: Mitcham, Jennifer L. 
APPLICANT: Harlocker, Susan L. 
APPLICANT: Jiang, Yuqui 
APPLICANT: Henderson, Robert A. 
APPLICANT: Kalos, Michael D. 
APPLICANT: Fanger, Gary R. 
APPLICANT: Retter, Marc W. 
APPLICANT: Stolk, John A. 
APPLICANT: Day, Craig H. 
APPLICANT: Vedvick, Thomas S. 
APPLICANT: Carter, Darrick 
APPLICANT: Li, Samuel 
APPLICANT: Wang, Aijun 
APPLICANT : Skeiky, Yasir A.W. 
APPLICANT: Hepler, William 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY AND 
TITLE OF INVENTION : DIAGNOSIS OF PROSTATE CANCER 
FILE REFERENCE: 210121 . 42717C17 
CURRENT APPLICATION NUMBER: US/09/636 , 215 
CURRENT FILING DATE: 2000-08-10 
NUMBER OF SEQ ID NOS : 852 
SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 780 
LENGTH: 10 95 
TYPE : PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/KEY: VARIANT 
LOCATION: (1) . . . (1095) 

OTHER INFORMATION: Xaa = Any Amino Acid 
US -09-636-215-780 

Query Match 27.9%; Score 2216; DB 4; Length 10 95 ; 

Best Local Similarity 41.2%; Pred . No. 4.6e-210; 

Matches 46 8; Conservative 20 8; Mismatches 353; Indels 10 8; Gaps 22; 

Qy 56 QESLSSWIPENIKKKECVYFVESSKLSDAGKWCQCGYTHEQHLEEATKPHTFQGTQ 112 

= 1-11 Ihllhl = li | : IhlM Ihl Ml 
Db 2 9 ESDLVNFIQANFKKRECVFFTKDSK ATENVCKCGYAQSQHME GTQINQ 76 

Qy 113 WDPKKHVQEMPTDAFGDIVFTGLSQKVKKYVRVSQDTPSSVIYHLMTQHWGLDVPNL 16 9 

h III =1 llllllll I I :| Ihhl II : =:| hllll I III 
Db 77 SEKWNYKKHTKEFPTDAFGDIQFETLGKK-GKYIRLSCDTDAEILYELLTQHWHLKTPNL 13 5 

Qy 170 LISVTGGAKNFNMKPRLKSIFRRGLVKVAQTTGAWIITGGSHTGVMKQVGEAVRDFSLSS 229 

hlllllllll =111 = = II I h hh lllhllhl hll hi III ::| 
Db 136 VISVTGGAKNFALKPRMRKIFSR- LIYIAQSKGAWILTGGTHYGLMKYIGEWRDNTISR 194 

Qy 23 0 SYKEGELITIGVATWGTVHRREGLIH PTGSFPAEYILDEDGQGNLTCLDSNHSHFIL 286 

I :| =: Ihl Nihil II hh:h = I Ihlhl =| 

Db 195 SSEE-NIVAIGIAAWGMVSNRDTLIRNCDAEGYFLAQYLMDDFTRDPLYILDNNHTHLLL 253 

Qy 2 87 VDDGTHGQYGVEIPLRTRLEKFISEQTKERGGVAIKIPIVCWLEGGPGTLHTIDNATTN 346 



I hi II II II :|||:||hi : I II II I II II h = I 

Db 254 VDNGCHGHPTVEAKLRNQLEKYISERTIQDSNYGGKIPIVCFAQGGGKETLKAINTSIKN 313 

Qy 34 7 GTPCVWEGSGRVADVIAQVANLPVSD- ITISLIQQKLSVFFQEMFETFTESRIVEWTKK 4 05 

IIIMIIIhUIIII : = I i =1 I = = = I I I III 
Db 314 KIPCWVEGSGQIADVIASL- - VEVEDALTSSAVKEKLVRFLPRTVSRLPEEETESWIKW 3 71 

Qy 406 IQDIVRRRQLLTVFREGKDGQQDVDVAILQALLKASRSQDHFGHENWDHQLKLAVAWNRV 465 

■'■■'•\-- WW = = I : I II II II = = =11= Nil : lh = 

Db 372 LKE ILECSHLLTVI KMEEAGDEI VSNAI SYALYKAFSTSEQ - DKDNV/NGQLKLLLEWNQL 430 

Qy 466 DIARSEIFMDEWQWKPSDLHPTMTAALISNKPEFVKLFLENGVQLKEFVTWDTLLYLYEN 52 5 

1 = 1 III - =1= =11 I III = = hlhlllllh h = hl M h I 
Db 431 DLANDEIFTNDRRWESADLQEVMFTALIKDRPKFVRLFLENGLNLRKFLTHDVLTELFSN 4 90 

Qy 526 LDPSCLFHSKLQKVLVEDPERPACAPAAPRLQMHHVAQVLRELLGDFTQPLYPRPRHNDR 5 85 

11=11 I : : :|: :| : | | | 

Db 4 91 -HFSTLVYRNLQ IAKNSYNDALLTFVWKLVANFRRGFRKEDR-NGR 5 34 

Qy 586 LRLLLPVPHVKLNVQGVSLRSLYKRSSGHVTFTMDPIRDLLIWAIVQNRRELAGIIWAQS 645 

= = : II I I = = I llll = ll=:|h =11 h 

Db 535 DEMDIELHDVS PITRHPLQALFIWAILQNKKELSKVIWEQT 575 

Qy 64 6 QDCIAAALACSKILKELSKEEEDTDSSEEMLALAEEYEHRAIGVFTECYRKDEERAQKLL 705 

= I Ml 11=11 hi = I : = = I II III 11= =11111 11= h = ll 
Db 5 76 RGCTL7AALGASKLLKTLAKVKNDINAAGESEELANEYETRAVELFTECYSSDEDLAEQLL 635 

Qy 706 TRVSEAWGKTTCLQLALEAKDMKFVSHGGIQAFLTKVWWGQLSVDNGLWRVTLCMLAFPL 7 65 

Mil = Ihlhll I h= hi Ihl hh = l I h= |h || 
Db 63 6 VYSCEAWGGSNCLELAVEATDQHFIAQPGVQNFLSKQWYGEISRDTKNWKI ILCLFI I PL 6 95 

Qy 76 6 LLTGLISFREKRLQDVGTP7AARAR AFFTAPVWFHLNILSYFAFLCLFAYVLM 818 

I =111 = 1 I = = 1111 = 1 III l = = I Ml I ! 1 1 1 h 

Db 6 96 VGCGFVSFRKK PVDKHKKLLWYYVAFFTSPFWFSWNWFYIAFLLLFAYVLL 74 8 

Qy 819 VDFQPVPSWCECAIYLWLFSLVCEEMRQLFYDPDECGLMKKAALYFSDFWNKLDVGAILL 8 78 

= 11 II I =1 :| I hhll = = Ihl M =| = 

Db 74 9 MDFHSVPHPPELVLYSLVFVLFCDEVRQWYVN GVNYFTDLWNVMDTLGLFY 79 9 

Qy 87 9 FVAGLTCRL 1 PATLYPGRVI LSLDFI LFCLRLMH I FT I S KTLG PKI 1 1 VKRMMKDVF 93 5 

hlh II ::|| M I I Ihhl llhlllhh lllllh-lh III 

Db 800 FIAGIVFRLHSSNKSSLYSGRVIFCLDYITFTLRLIHIFTVSRNLGPKI IMLQRMLIDVF 859 

Qy 93 6 FFLFLLAVWWSFGVAKQAILIHNERRVDWLFRGAVYHSYLTIFGQIPGYIDGVNFNPEH 9 95 

HIM I h II': I I Ihl hll :| II =111 = 1 =11 == I 
Db 86 0 FFLFLFAXWMVAFGVARQGILRQNEQRWRWIFRSVIYEPYLAMFGQVPSDVDGTTYDFAH 919 

Qy 996 CSPNGTDPYKPKCPESDATQQRPAFPEWLTVLLLCLYLLFTNILLLNLLIAMFNYTFQQV 1055 

II I 1111 = 1= 1 = 1 = 1 = 1 llllhllhlll II I 

Db 920 CTFTGNES-KPLCVELD-EHNLPRFPEWITIPLVCIYMLSTNILLVNLLVAMFGYTVGTV 977 

Qy 1056 QEHTDQIWKFQRHDLIEEYHGRPAAPPPFILLSHLQLFIKRWLKTPAKRHKQLKNKLEK 1115 

11= 11=11111= h = ll I I llh == = =1= = = = = I 

Db 97 8 QENNDQVWKFQRYFLVQEYCSRLNIPFPFIVFAYFYMWKKCFKCCCKEKNMESSVCCFK 103 7 

Qy 1116 NEEAALLSWEIYLKENYLQNRQFQQKQRPEQK IEDISNKV 1155 

Ih hll :||IM = h = = =1 = 11 = 

Db 103 8 NEDNETLAWEGVMKENYLVKINTKANDTSEEMRHRFRQLDTKLNDLKGLLKEIANKI 1094 



SUMMARIES 

% 

Result Query 



> . 


Score 


Match Length DB 


ID 


Description 


1 


7950 


100 . 0 


1503 


1 


TRL2_HUMAN 


094759 homo sapien 


2 


1282 . 5 


16 . 1 


2032 


1 


TRPG__CAEEL 


Q93971 caenorhabdi 


3 


659 


8 . 3 


1017 


1 


TRL3_HUMAN 


Q9hcf6 homo sapien 


4 


450 


5 . 7 


1418 


1 


CE11_CAEEL 


P34641 caenorhabdi 


5 


436.5 


5 . 5 


350 


1 


NUD9_HUMAN 


Q9bw91 homo sapien 



6 


377.5 


4 


.7 


260 


1 


YQ08_CAEEL 


Q09297 


caenorhabdi 


7 


356 


4 


5 


1027 


1 


TRPL_CAEEL 


P34586 


caenorhabdi 


8 


344 


4 


3 


1124 


1 


TRPL_DROME 


P48994 


drosophila 


9 


341.5 


4 


3 


975 


1 


TRP5_MOUSE 


Q9qx2 9 


mus musculu 


10 


340 


4 


3 


973 


1 


TRP5_HUMAN 


Q9ul62 


homo sapien 


11 


334 .5 


4 


2 


974 


1 


TRP5_RABIT 


062852 


oryctolagus 


12 


331 


4 


2 


885 


1 


TRP2_RAT 


Q9r283 


rattus norv 


13 


331 


4 


2 


977 


1 


TRP4_HUMAN 


Q9ubn4 


homo sapien 


14 


324.5 


4 


1 


981 


1 


TRP4_BOVIN 


P79100 


bos taurus 


15 


321.5 


4 


0 


1275 


1 


TRPJDROME 


P19334 


drosophila 


16 


321 


4 


0 


977 


1 


TRP4_RAT 


035119 


rattus norv 



RESULT 1 
TRL2 HUMAN 



ID TRL2_HUMAN STANDARD; PRT ; 15 03 AA. 

AC 094 75 9; Q96KN6; 

DT 30-MAY-2000 (Rel . 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Long transient receptor potential channel 2 (LTrpC2) (Transient 

DE receptor potential channel 7) (TrpC7) . 

GN TRPM2 OR LTRPC2 OR TRPC7 OR KNP3 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 
RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1). 

RC TISSUE-Brain; 

RX MEDLINE=99026133 ; PubMed=9806837 ; 

RA Nagamine K . , Kudoh J. ; Minoshima S., Kawasaki K. , Asakawa S., Ito F. , 

RA Shimizu N. ; 

RT "Molecular cloning of a novel putative Ca2+ channel protein (TRPC7) 

RT highly expressed in brain."; 

RL Genomics 54:124-131(1998). 
RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 2) . 

RX MEDLINE=22075135 ; PubMed=11960981 ; 

RA Wehage E . , Eisfeld J., Heiner I., Jungling E . , Zitt C, Luckhoff A. ; 

RT "Activation of the cation channel long transient receptor potential 

RT channel 2 (LTRPC2) by hydrogen peroxide. A splice variant reveals a 

RT mode of activation independent of ADP-ribose . " ; 

RL J. Biol. Chem. 2 77:23150-23156(2002). 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20289799; PubMed-10830953 ; 

RA Hattori M . , Fujiyama A., Taylor T.D., Watanabe H . , Yada T., 

RA Park H.-S., Toyoda A., Ishii K., Totoki Y., Choi D.-K., Groner Y., 

RA Soeda E., Ohki M . , Takagi T., Sakaki Y., Taudien S., Blechschmidt K. , 

RA Polley A., Menzel U. , Delabar J., Kumpf K. , Lehmann R. , Patterson D., 

RA Reichwald K. , Rump A., Schillhabel M., Schudy A., Zimmermann W., 

RA Rosenthal A., Kudoh J . , Shibuya K. , Kawasaki K. , Asakawa S., 

RA Shintani A., Sasaki T. , Nagamine K . , Mitsuyama S., Antonarakis S.E., 

RA Minoshima S., Shimizu N., Nordsiek G., Hornischer K. , Brandt P., 

RA Scharfe M., Schoen O., Desario A., Reichelt J., Kauer G., Bloecker H., 

RA Ramser J., Beck A., Klages S., Hennig S., Riesselmann L., Dagand E., 

RA Wehrmeyer S., Borzym K., Gardiner K. , Nizetic D. , Francis F., 

RA Lehrach H., Reinhardt R. , Yaspo M.-L.; 

RT "The DNA sequence of human chromosome 21."; 

RL Nature 405:311-319(2000). 

CC FUNCTION: May be a calcium channel. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein (Probable). 

CC - ! - ALTERNATIVE PRODUCTS : 

CC Event=Alternative splicing; Named isoforms=2; 

CC Comment ^Additional isoforms seem to exist; 

CC Name=l; 

CC Isold=094759-1 ; Sequence=Displayed ; 

CC Name -2 ; 

CC IsoId=094759-2 ; Sequence=VSP_006574 , VSP_006575; 

CC -!- TISSUE SPECIFICITY: Highly expressed in brain. 



cc 


-!- SIMILARITY: 


Belongs to 


the transient receptor family. LTrpC 






cc 


subfamily . 










cc 














cc 


This SWISS -PROT 


entry is copyright. It is produced through a collaboration 


cc 


between 


the Swiss Institute of Bioinf ormatics and the EMBL outstation - 


cc 


the European Bioinf ormatics 


Institute. There are no restrictions 


on 


its 


cc 


use by 


non-profit institutions as long as its content is in 


no 


way 


cc 


modified 


and this statement 


is not removed. Usage by and for commercial 


cc 


entities 


requires a license 


agreement (See http://www.isb-sib.ch/announce/ 


cc 
cc 

DR 


or send an email to license@isb-sib . ch) . 






EMBL; AB0015 35; 


BAA34700 . 1; 








DR 


EMBL; AJ417076; 


CAD01139.1; 








DR 


EMBL; AP001754; 


BAA95563 .1; 








DR 


Genew; HGNC : 12 33 9; TRPM2 . 








DR 


MIM; 603749; -. 










DR 


GO; GO:0005887; 


C: integral to plasma membrane; TAS . 






DR 


GO; GO:0005262; 


F: calcium channel activity; TAS. 






DR 


GO; GO:0006816; 


P: calcium ion transport; TAS. 






DR 


Inter Pro; 


IPR002111; Cat_channel TrpL . 






DR 


InterPro; 


IPR005821; Ion_trans . 






DR 


InterPro; 


IPR002153; Trans receptor. 






DR 


Pfam; PF00520; ion_trans; 1 








DR 


PRINTS; PR01097; 


TRNSRECEPTRP . 






KW 


Ionic channel; Transmembrane; Ion transport; Calcium channel; 






KW 


Alternative splicing. 








FT 


DOMAIN 


1 


752 


CYTOPLASMIC (POTENTIAL) . 






FT 


TRANSMEM 


753 


773 


POTENTIAL. 






FT 


DOMAIN 


774 


795 


EXTRACELLULAR (POTENTIAL) . 






FT 


TRANSMEM 


796 


816 


POTENTIAL . 






FT 


DOMAIN 


817 


820 


CYTOPLASMIC (POTENTIAL) . 






FT 


TRANSMEM 


821 


841 


POTENTIAL . 






FT 


DOMAIN 


842 


896 


EXTRACELLULAR (POTENTIAL) . 






FT 


TRANSMEM 


897 


917 


POTENTIAL . 






FT 


DOMAIN 


918 


936 


CYTOPLASMIC (POTENTIAL) . 






FT 


TRANSMEM 


937 


957 


POTENTIAL. 






FT 


DOMAIN 


958 


1025 


EXTRACELLULAR (POTENTIAL) . 






FT 


TRANSMEM 


1026 


1046 


POTENTIAL. 






FT 


DOMAIN 


1047 


1503 


CYTOPLASMIC (POTENTIAL) . 






FT 


VARSPLIC 


538 


557 


Missing (in isoform 2) . 






FT 








/FTId=VSP_006574 . 






FT 


VARSPLIC 


1291 


1325 


DTLEPLSTIQYNWDGLRDRRSFHGPYTVQAGLPL - > 


E 




FT 








(in isoform 2) . 






FT 








/FTId=VSP_006575 . 






FT 


CONFLICT 


1088 


1088 


S -> N (IN REF . 2) . 






FT 


CONFLICT 


1189 


1189 


R -> Q (IN REF. 3) . 






SQ 


SEQUENCE 


1503 AA; 171225 


MW; AD329AE79F1A71B5 CRC64 ; 







Query Match 100.0%; Score 7950; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 1503; Conservative 0; Mismatches 



DB 1; Length 1503; 
0; Indels 0; 



Gaps 



0 ; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 



Ml i ill Nil I UN MM III l!M I IN | 



Ml 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 II 1 1 1 M I i ! I i 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 



1 1 1 M M 1 1 1 J I II 1 1 1 1 M f M 1 1 M M II 1 1 1 1 1 1 II 1 1 1 1 1 



II 



Mill 



MM MM Mil ill I MM Ml 1 



II 



Qy 301 LRTRLEKFISEQTKERGGVAIKIPIVCWLEGGPGTLHTIDNATTNGTPCVWEGSGRVA 360 



Db 301 LRTRLEKFISEQTKERGGVAIKIPIVCWL 360 

QY 361 DVIAQVANLPVSDITISLIQQKLSVFFQEMFETFTESRIVEWTKKIQDIVRRRQLLTVFR 420 



Db 361 DVIAQVANLPVSDITISLIQQKLSVFFQEMFET^ 420 

Qy 421 EGKDGQQDVDVAILQALLKASRSQDHFGHENWDHQLKLAVAWNRVDIARSEIFMDEWQWK 480 



Db 421 EGKDGQQDVDVAILQALLKASRSQDHFGHEIWDHQLK 480 

Qy 481 PSDLHPTMTAALISNKPEFVKLFLENGVQLKEFVTWDTLLYLYENLDPSCLFHSKLQKVL 540 

Db 4 81 PSDLHPTMTAALISNKPEFVKLFLENGVQLKEFVT 54 0 

Db 54 1 VEDPERPACAPAAPRLQMHHVAQVLRELLGDFTQPLyUp 6 00 

Qy 601 GVSLRSLYKRSSGHVTFTMDPIRDLLIWAIVQNRRELAGIIWAQSQDCIAAALACSKILK 660 

Db 601 GVSLRSLYKRSSGHVTFTMDPIRDLLIWAIVQNRRE^ 660 

Qy 6 61 ELSKEEEDTDSSEEMLALAEEYEHRAIGVFTECYRKDEERAQKLLTRVSEAWGKTTCLQL 720 

Db 6 61 elskeeedtdsseemialaeeyehraigvftecyrkidee^ 720 

Qy 721 ALEA^MKFVSHGGIQAFLTKVWW^^ 780 

Db 721 ALEAKDMKFVSHGGIQAFLTKVWWGQL^^ 780 

Qy 7S1 VGTPAARARAFFTAPVWFHLNILSYFAFLCLFAYVLMVDFQPVPSWCECAIYLWLFSLV 84 0 

Db 781 V GTPAARARAFFTAPVWFHLNILSYFAFLCLFAYVLMVDFQPVPS^ 840 

Qy 341 CEEMRQLFYDPDECGLMKKAALYFSDFWNKLDVGAILLFVAGLTCRLIPATLYPGRVILS 900 

I Ml Mill I III II llllll I Mill I Mill II I ill MINIMI III Mil I Mil 

Db 341 CEEMRQLFYDPDECGLMKKAALYFSDFWNKLDVGAILLFVAGLTCRLIPATLYPGRVILS 900 



Qy 901 LDFILFCLRLMHIFTISKTLGPKIIIVKRMMKDVFFFLFLLAVWWSFGVAKQAILIH'NE 960 

M I M II I M 1 1 II II 1 1 1 II II II I II 1 1 II 1 1 1 M II II II I II II 1 1 II 1 1 II II 1 1 

LDFILFCLRLMHIFTISKTLGPKIIIVKRMMKDVFFFLFLLAVWWSFGVAKQAILIHNE 960 



Db 901 



Qy 96 i RRVDWLFRGAVYHSYLTIFGQIPGYIDGVNFNPEHCSPNGTDPYKPKCPESDATQQRPAF 1020 

I I III I II III 1 1 1" II IN !l 1 1 1 1 1 M I 1 II! lilt 1 1,1 1 III I II M 

RRVDWLFRGAVYHSYLTIFGQIPGYIDGVNFNPEHCSPNGTDPYKPKCPESDATQQRPAF 102 0 



Db 961 



Qy 1021 PEWLTVLLLCLYLLFTNILLLNLLIAMFNYTFQQVQEHTDQIWKFQRHDLIEEYHGRPAA 1080 

mmmmmmmmmmmimimimmiimmmmmiiiimiimimm 

Djd 1021 PEWLTVLLLCLYLLFTNILLLNLLIAMFNYTFQQVQEHTDQIWKFQRHDLIEEYHGRPAA 1080 

Qy 1081 PPPFILLSHLQLFIKRWLKTPAKRHKQLKNKLEKNEEAALLSWEIYLKENYLONROFOO 1140 

in inn f 1 1 1 1 r 1 1 1 1 1 1 1 1 r 1 1 m f 1 1 i 1 1 1 f f 1 11 

Db 1081 PPPFILLSHLQLFIKRWLKTPAKRHKQLKNKLEKNEEAALLSWEIYLKENYLQNRQFQQ 114 0 

Qy 1141 KQRPEQKIEDISNKVDAMVDLLDLDPLKRSGSMEQRLASLEEQVAQTARALHWIVRTLRA 1200 

' 'I M ! 1 1 Nil III 1 1 II 1 1 'I ! M II IMM 1 1 ,1 1,1 1, 1 1 1 1| IN I ! 1 1 |i || 1 

Db 1141 KQRPEQKIEDISNKVDAMVDLLDLDPLKRSGSMEQRLASLEEQVAQTARALHWIVRTLRA 1200 

QY 1201 SGFSSEADVPTLASQKAAEEPDAEPGGRKKTEEPGDSYHVNARHLLYPNCPVTRFPVPNE 1260 

MIIIIIIMIIIIIIIIIIMMIMIMIIIIIII 

Db 1201 SGFSSE.ADVPTLASQKAAEEPDAEPGGRKKTEEPGDSYHVNARHLLYPNCPVTRFPVPNE 1260 

Qy 1261 KVPWETEFLIYDPPFYTAERKDAAAMDPMGDTLEPLSTIQYNWDGLRDRRSFHGPYTVQ 1320 

MIMMMMIMMIIIIIIIIMMMIIIIIMIIMIMIIMIIIIIIIIilli 

Db 1261 KVPWETEFLIYDPPFYTAERKDAAAMDPMGDTLEPLSTIQYNWDGLRDRRSFHGPYTVQ 13 20 

Qy 1321 AGLPLN PMGRTGLRGRGSLS CFGPNHTLY PMVTRWRRNEDGAI CRKS I KKMLE VLWKL P 1380 



Db 13 21 AGLPLNPMGRTGLRGRGSLSCFGPNHTLYPMWRWRRNEDGAICRKSIKKMLEVLVVKLP 13 8 0 

Qy 13 31 LSEHWALPGGSREPGEMLPRKLKRILRQEHWPSFENLLKCGMEVYKGYMDDPRNTDNAWI 144 0 

Db 1381 LSEHWALPGGSREPGEMLPRKLKRILRQEHWPSFENLLKCGMEVYKGYMDDPRNTDNAWI 1440 

QY 1441 ETVAVSVHFQDQNDVELNRLNSNLHACDSGASIRWQWDRRIPLYANHKTLLQKAAAEFG 1500 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1441 ETVAVS VHFQDQNDVELNRLNSNLHACDSGAS I RWQ WDRRI PLYANHKTLLQKAAAEFG 15 00 

Qy 1501 AHY 1503 

III 

Db 1501 AHY 1503 



SEQ ID NO: 2 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


4512 


100 


. 0 


4512 


6 


AX574359 


AX5 743 5 9 Sequence 


2 


4512 


100 


.0 


6220 


6 


BD270147 


BD270147 Character 


3 


4512 


100 


. 0 


6220 


6 


AR306288 


AR306288 Sequence 


4 


4512 


100 


.0 


6220 


6 


AX042216 


AX042216 Sequence 


5 


4512 


100 


.0 


6220 


6 


AX574360 


AX574360 Sequence 


6 


4512 


100 


. 0 


6220 


9 


AB001535 


AB001535 Homo sapi 


7 


4163 . 2 


92 


.3 


5284 


9 


HSA4170 76 


AJ4170 76 Homo sapi 


8 


3220 


71 . 


. 4 


5980 


10 


MMU34 4343 


AJ344343 Mus muscu 


9 


1600 . 3 


35. 


.5 


1S3 8 


6 


BD107877 


3D107877 36 human 


10 


1280 . 8 


28 . 


, 4 


1314 


6 


BD107862 


BD107862 .36 human 


11 


705 . 8 


15 . 


, 6 


1670 


9 


AB017549 


ABO 175 4.9 Homo sapi 


12 


676 . 2 


15 . 


0 


3913 


6 


AX135949 


AX13594 9 Sequence 


13 


676 . 2 


15 . 


0 


3913 


9 


AF177473 


AF1~>74 73 Homo sapi 


14 


6 72 . a 


14 . 


9 


3804 


6 


AX708830 


AX708830 Sequence 


15 


667.6 


14 . 


8 


4184 


10 


AY072788 


AY0 72 73 3 Rattus no 



RESULT 4 
AX04 2216 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



CDS 



AX042216 6220 bp 

Sequence 5 from Patent WO0065056. 
AX042216 

AX042216.1 GI:11340930 



DNA. 



linear PAT 2 3 -NOV- 2 000 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrh ini; Hominidae; Homo. 

1 

Scharenberg , A . M . 

Nucleic acids encoding a mutt domain-containing polypeptide 

Patent: WO 0065056 -A 5 02-NOV-2000; 

Beth Israel Deaconess Medical Center, Inc. (US) 

Location/Qualifiers 

1. .6220 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 9606" 
446. .4957 

/note= "unnamed protein product" 
/codon_start=l 
/protein_id="CAC17068 .1" 
/db_xref ="GI : 11340931" 
/db_xre f = " REMTREMBL : CAC1 706 8" 

/translation="MEPSALRKAGSEQEEGFEGLPRRVTDLGMVSNLRRSNSSLFKSW 
RLQCPFGNNDKQESLSSWI PENIKKKECVYFVESSKLSDAGKWCQCGYTHEQHLEEA 
TKPHTFQGTQWDPKKHVQEMPTDAFGDIVFTGLSQKVKKYVRVSQDTPSSVIYHLMTQ 
HWGLDVPNLLI S VTGGAKNFNMKPRLKS I FRRGLVKVAQTTGAWI ITGGSHTG VMKQV 



GEAVRDFSLSSSYKEGELITIGVATWGTVHRREGLIHPTGSFPAEYILDEDGQGNLTC 
LDSNHSHFILVDDGTHGQYGVEIPLRTRLEKFISEQTKERGGVAIKIPIVCWLEGGP 
GTLHTIDNATTNGTPCWVEGSGRVADVIAQVANLPVSDITISLIQQKLSVFFQEMFE 
TFTESRIVEWTKKIQDIVRRRQLLTVFREGKDGQQDVDVAILQALLKASRSQDHFGHE 
NWDHQLKLAVAWNRVDIARSEIFMDEWQWKPSDLHPTMTAALISNKPEFVKLFLENGV 
QLKEFVTWDTLLYLYENLDPSCLFHSKLQKVLVEDPERPACAPAAPRLQMHHVAQVLR 
ELLGDFTQPLYPRPRHNDRLRLLLPVPHVKLNVQGVSLRSLYKRSSGHVTFTMDPIRD 
LLIWAIVQNRRELAGIIWAQSQDCIAAALACSKILKELSKEEEDTDSSEEMLALAEEY 
EHRAIGVFTECYRKDEERAQKLLTRVSEAWGKTTCLQLALEAKDMKFVSHGGIQAFLT 
KVWWGQLSVDNGLWRVTLCMLAFPLLLTGLISFREKRLQDVGTPAARARAFFTAPVW 
FHLNILSYFAFLCLFAYVLMVDFQPVPSWCECAIYLWLFSLVCEEMRQLFYDPDECGL 
MKKAALYFSDFWNKLDVGAILLFVAGLTCRLIPATLYPGRVILSLDFILFCLRLMHIF 
TISKTLGPKIIIVKRMMKDVFFFLFLLAVWWSFGVAKQAILIHNERRVDWLFRGAVY 
HSYLTIFGQIPGYIDGVNFNPEHCSPNGTDPYKPKCPESDATQQRPAFPEWLTVLLLC 
LYLLFTNILLLNLLIAMFNYTFQQVQEHTDQIWKFQRHDLIEEYHGRPAAPPPFILLS 
HLQLFIKRWLKTPAKRHKQLKNKLEKNEEAALLSWEIYLKENYLQNRQFQQKQRPEQ 
KIEDISNKVDAMVDLLDLDPLKRSGSMEQRLASLEEQVAQTARALHWIVRTLRASGFS 
SEADVPTLASQKAAEEPDAEPGGRKKTEEPGDSYHVNARHLLYPNCPVTRFPVPNEKV 

PWETE FLIYDPP fytaerkdaaamd pmgdtleplsti qyn wdglrdrrs fhg p ytvq 

AGLPLNPMGRTGLRGRGSLSCFGPNHTLYPMVTRWRRNEDGAICRKSIKKMLEVLWK 
LPLSEHWALPGGSREPGEMLPRKLKRILRQEHWPSFENLLKCGMEVYKGYMDDPRNTD 
NAWIETVAVSVHFQDQNDVELNRLNSNLHACDSGASIRWQWDRRIPLYANHKTLLQK 
AAAEFGAHY" 



ORIGIN 



Query Match 100.0%; Score 4512; DB 6; Length 6220; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4 512; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
07 1 ATGGAGCCCTCAGCCCTGAGGAAAGCTGGCTCGGAGCAGGAGGAGGGCTTTGAGGGGCTG 6 0 

llllillilllllllllillllllllllllillllllllilllllllllllllNillll 

Db 446 ATGGAGCCGTCAGCCCTGAGGAAAGCTGGCTCGGAGCAG3AGGAGGGCTTTGAGGGGCTG 5 05 

QV 61 CCCAGAAGGGTCACTGACCTGGGGATGGTCTCCAATCTCCGGCGCAGCAACAGCAGCCTC 12 0 

llllllillllMIIIIMIIIIIIMillMIMIIIilllllllllllllllllllll . 

Db ^>06 CCCAGAAGGGTCACTGACCTGGGGATGGTCTCCAATCTCCGGCGCAGCAACAGCAGCCTC 5 65 

CY 121 TTCAAGAGCTGGAGGCTACAGTGCCCCTTCGGCAACAATGACAAGCAAGAAAGrCTCAGT 18 0 

i 1 1 ; I ' : I 

Db T T C AAG AGCTG G AGG CT AC AG TG CCCCTTCGG C AA C AAT G AC AAG C AAG A AAG C C T C AG T 62 5 

Qi' 181 TCGTGGATTCCTGAAAACATCAAGAAGAAAGAATGCGTGTATTTTGTGGAAAGTTCCAAA 24 0 

milll MMIMIIHI Mill II jlllHIillllllllllllllll MINIUM 

Db 62 6 TCGTGGATTCCTGAAAACATCAAGAAGAAAGAATGCGTGTATTTTGTGGAAAGTTCCAAA 685 

0'/ 241 CTGTCTGATGCTGGGAAGGTGGTGTGTCAGTGTGGCTACACGCATGAGCAGCACTTGGAG 3 00 

1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 H 1 1 1 1 1 1 II 1 1 1 1 H 1 1 1 1 1 1 1 1 1 i I M 1 1 1 1 1 

Db 686 CTGTCTGATGCTGGGAAGGTGGTGTGTCAGTGTGGCTACACGCATGAGCAGCACTTGGAG 74 5 

Qy 3 01 GAGGCTACCAAGCCCCACACCTTCCAGGGCACACAGTGGGACCCAAAGAAACATGTCCAG 36 0 

I M 1 1 i 1 1 [ 1 1 1 1 1 1 1 1 1 ! I i 1 1 ! 1 1 ! 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 [ 1 1 1 1 1 1 1 

Db 7 4 6 GAGGCTACCAAGCCCCACACCTTCCAGGGCACACAGTGGGACCCAAAGAAACATGTCCAG 8 05 

361 GAGATGCCAACCGATGCCTTTGGCGACATCGTCTTCACGGGCCTGAGCCAGAAGGTGAAA 42 0 

1 1 1 1 1 1 f i f 1 1 1 f ( f 1 1 1 1 1 ! i r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f f 1 1 r i f 1 1 1 1 

Db 806 GAGATGCCAACCGATGCCTTTGGCGACATCGTCTTCACGGGCCTGAGCCAGAAGGTGAAA 8 65 

Qv 421 AAGTACGTCCGAGTCTCCCAGGACACGCCCTCCAGCGTGATCTACCACCTCATGACCCAG 48 0 

1 1 1 1 1 1 II M 1 1 1 1 1 II I M 1 1 M M 1 1 II I II I M 1 1 1 1 1 1 1 II I M I M 1 1 1 M 1 1 1 1 

Db 866 AAGTACGTCCGAGTCTCCCAGGACACGCCCTCCAGCGTGATCTACCACCTCATGACCCAG 92 5 



^/ 4 81 CACTGGGGGCTGGACGTCCCCAATCTCTTGATCTCGGTGACCGGGGGGGCCAAGAACTTC 54 0 

1 1 1 1 1 1 M 1 1 1 1 1 1 II M 1 1 M II 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 M 1 1 II 1 1 II M M 1 1 1 1 

CACTGGGGGCTGGACGTCCCCAATCTCTTGATCTCGGTGACCGGGGGGGCCAAGAACTTC 985 



Db 926 



Qy 541 AACATGAAGCCGCGGCTGAAGAGCATTTTCCGCAGAGGCCTGGTCAAGGTGGCTCAGACC 60 0 

Db 98 6 AACATGAAGCCGCGGCTGAAGAGCATTTTCCGCAGAGGCCTGGTCAAGGTGGCTCAGACC 104 5 

Q 1 / 6 °1 ACAGGGGCCTGGATCATCACAGGGGGGTCCCACACCGGCGTCATGAAGCAGGTAGGCGAG 66 0 

I M 1 1 1 1 1 M I I M I I M I II I I M II I I I I I II I M I II II I I | | M I I M I II II I II 



Db 


104 6 ACAGGGGCCTGGATCATCACAGGGGGGTCCCACACCGGCGTCATGAAGCAGGTAGGCGAG 110 5 


Qy 


661 GCGGTGCGGGACTTCAGCCTGAGCAGCAGCTACAAGGAAGGCGAGCTCATCACCATCGGA 72 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 I 1 1 1 1 1 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 1 1 I I I 
1 1 i i i i i 1 1 i i i i i i 1 1 1 i ) i i i i i i i i i i i i i i i 1 1 1 1 [ i 1 1 1 i | 1 1 

110 6 GCGGTGCGGGACTTCAGCCTGAGCAGCAGCTACAAGGAAGGCGAGCTCATCACCATCGGA 1165 


Db 


Qy 


721 GTCGCCACCTGGGGCACTGTCCACCGCCGCGAGGGCCTGATCCATCCCACGGGCAGCTTC 7 8 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 i l l l l 1 l l l l l l l l l l l f 1 l l | l | | | | | | 1 | ! | 1 | | | | | | | | | | l | | | j | | | 

116 6 GTCGCCACCTGGGGCACTGTCCACCGCCGCGAGGGCCTGATCCATCCCACGGGCAGCTTC 12 2 5 


Db 


Qy 


781 CCCGCCGAGTACATACTGGATGAGGATGGCCAAGGGAACCTGACCTGCCTAGACAGCAAC 84 0 

1 1 1 1 1 1 1 1 1 1 1 ! 1 i 1 1 1 1 1 1 1 1 1 1 1 1 i I 1 1 1 1 1 1 1 1 1 1 I I 1 II 1 1 1 1 1 1 1 M II I I I i i i 


Db 


' 1 1 1 1 1 1 1 1 i i ' i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i t i i i i i i i 1 1 [ | | | i | I 

122 6 CCCGCCGAGTACATACTGGATGAGGATGGCCAAGGGAACCTGACCTGCCTAGACAGCAAC 12 8 5 


Qy 


841 CACTCTCACTTCATCCTCGTGGACGACGGGACCCACGGCCAGTACGGGGTGGAGATTCCT 900 
1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I 1 


Db 


i^n^ i i i i m i i i ii i i i i i i i i i i i M I I I I I I I I I j I I I M 1 1 I ! 1 1 M 1 I 1 1 M ! ' 

12 86 CACTCTCACTTCATCCTCGTGGACGACGGGACCCACGGCCAGTACGGGGTGGAGATTCCT 1345 


Qy 


901 CTGAGGACCAGGCTGGAGAAGTTCATATCGGAGCAGACCAAGGAAAGAGGAGGTGTGGCC 96 0 


Db 


IIIIIIIIIIIMIIIIIIIINIIIIIIIIIMIIIIIIIIIIIIMIIIIII! 

134 6 CTGAGGACCAGGCTGGAGAAGTTCATATCGGAGCAGACCAAGGAAAGAGGAGGTGTGGCC 1405 


Qy 


961 ATCAAGATCCCCATCGTGTGCGTGGTGCTGGAGGGCGGCCCGGGCACGTTGCACACCATC 1020 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 


Db 


1 1 1 1 ' 1 1 1 J 1 1 i 1 ' 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 | | | 1 | | | | | | | 1 | | [ | | [ | | | | j | 
14 06 ATCAAGATCCCCATCGTGTGCGTGGTGCTGGAGGGCGGCCCGGGCACGTTGCACACCATC 14 65 


Qy 


1021 GACAACGCCACCACCAACGGCACCCCCTGTGTGGTTGTGGAGGGCTCGGGCCGCGTGGCC 10 8 0 

1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 t 1 II 1 1 
i i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 I ! 1 1 1 1 i 1 1 1 1 1 1 | | 1 | | | | | | j | | | | J | | | j | | | | | | | ( 

14 6 6 GACAACGCCACCACCAACGGCACCCCCTGTGTGGTTGTGGAGGGCTCGGGCCGCGTGGCC 1525 


Db 


Qy 


10 81 GACGTCATTGCCCAGGTGGCCAACCTGCCTGTCTCGGACATCACTATCTCCCTGATCCAG 114 0 
II 1 1 1 i 1 1 1 i ( 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 f 1 II 1 1 1 II 1 1 1 1 III 1 1 1 1 1 1 1 1 1 


Db 


1 1 1 1 1 1 1 1 1 1 ii i J i i > i i i i i i i l l i l I l 1 1 1 1 1 i 1 1 1 1 1 1 1 I I | | 1 | | i | 1 | | | j [ 1 
152 6 GACGTCATTGCCCAGGTGGCCAACCTGCCTGTCTCGGACATCACTATCTCCCTGATCCAG 1585 


Qy 


1141 CAGAAACTGAGCGTGTTCTTCCAGGAGATGTTTGAGACCTTCACGGAAAGCAGGATTGTC 12 0 0 


Db 


IIIIMIMIIIIIIIIIIIilllllllllllilllilllllMlllillllll 

15 86 CAGAAACTGAGCGTGTTCTTCCAGGAGATGTTTGAGACCTTCACGGAAAGCAGGATTGTC 164 5 


Qy 


12 01 GAGTGGACCAAAAAGATCCAAGATATTGTCCGGAGGCGGCAGCTGCTGACTGTCTTCCGG 1260 


Db 


IIIMIM III II II II II III II II Ml llllll III II II II III II Mil III II II 

1646 GAGTGGACCAAAAAGATCCAAGATATTGTCCGGAGGCGGCAGCTGCTGACTGTCTTCCGG 17 05 


Qy 


12 61 GAAGGCAAGGATGGTCAGCAGGACGTGGATGTGGCCATCTTGCAGGCCTTGCTGAAAGCC 13 2 0 

IIIIIIIMIIIIIIIIIIMMIIIIIIIMMIIIIII 

1706 GAAGGCAAGGATGGTCAGCAGGACGTGGATGTGGCCATCTTGCAGGCCTTGCTGAAAGCC 176 5 


Db 


Qy 


13 21 TCACGGAGCCAAGACCACTTTGGCCACGAGAACTGGGACCACCAGCTGAAACTGGCAGTG 13 8 0 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 ! 1 1 1 1 1 1 
1 1 ' 1 1 ' ' 1 l l l l l 'I ' 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ I 1 1 1 | ( j | | | | | 

1766 TCACGGAGCCAAGACCACTTTGGCCACGAGAACTGGGACCACCAGCTGAAACTGGCAGTG 1825 


Db 


Qy 


13 81 GCATGGAATCGCGTGGACATTGCCCGCAGTGAGATCTTCATGGATGAGTGGCAGTGGAAG 14 4 0 

II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 
i ' i i i i J i i i i i i l l l I 1 1 1 1 1 1 1 1 1 II 1 1 1 | I | 1 | | | | | | | | | | | | | | | [ | | [ J | J | | j 

18 2 6 GCATGGAATCGCGTGGACATTGCCCGCAGTGAGATCTTCATGGATGAGTGGCAGTGGAAG 1885 


Db 


Qy 


1441 CCTTCAGATCTGCACCCCACGATGACAGCTGCACTCATCTCCAACAAGCCTGAGTTTGTG 15 00 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 


Db 


1 1 1 1 1 1 1 1 1 1 1 1 1 i i i i i i i i i i I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I | 1 | 1 j J | 
1886 CCTTCAGATCTGCACCCCACGATGACAGCTGCACTCATCTCCAACAAGCCTGAGTTTGTG 194 5 


Qy 


15 01 AAGCTCTTCCTGGAAAACGGGGTGCAGCTGAAGGAGTTTGTCACCTGGGACACCTTGCTC 1560 


Db 


c ! 1" 1 1 1 1 1 M II 1 1 1 1 1 1 1 IN 1 1 1 Mill 1 1 'i llllll 1 II 1 II II 1 1 h I MM i 

1946 AAGCTCTTCCTGGAAAACGGGGTGCAGCTGAAGGAGTTTGTCACCTGGGACACCTTGCTC 2005 


Qy 


1561 TACCTGTACGAGAACCTGGACCCCTCCTGCCTGTTCCACAGCAAGCTGCAAAAGGTGCTG 162 0 

1 ! 1 1 1 1 1 1 1 1 1 1 1 I 1 i 1 1 1 i i i i i i i i i i i i « i i i i i ■ i i , . i . , 


Db 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | | | | | | | | | | | | | 

2 0 06 TACCTGTACGAGAACCTGGACCCCTCCTGCCTGTTCCACAGCAAGCTGCAAAAGGTGCTG 2 065 


Qy 


1621 GTGGAGGATCCCGAGCGCCCGGCTTGCGCGCCCGCGGCGCCCCGCCTGCAGATGCACCAC 16 8 0 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

2 066 GTGGAGGATCCCG AGCGCCCGGCTTGCGCGCCCGCGGCGCCCCGCCTGCAGATGCACCAC 2125 


Db 


Qy 


1681 GTGGCCCAGGTGCTGCGGGAGCTGCTGGGGGACTTCACGCAGCCGCTTTATCCCCGGCCC 174 0 



i U 1 1 ; 

Db 2126 GTGGCCCAGGTGCTGCGGGAGCTGCTGGGGGACTTCACGCAGCCGCTTTATCCCCGGCCC 218 5 

Qy 1741 CGGCACAACGACCGGCTGCGGCTCCTGCTGCCCGTTCCCCACGTCAAGCTCAACGTGCAG 1800 

MIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIllilMIIIIIIIIIIIII 

Db 218 6 CGGCACAACGACCGGCTGCGGCTCCTGCTGCCCGTTCCCCACGTCAAGCTCAACGTGCAG 224 5 

Qy 18 01 GGAGTGAGCCTCCGGTCCCTCTACAAGCGTTCCTCAGGCCATGTGACCTTCACCATGGAC 18 60 

IIIMIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMHI 

Db 224 6 GGAGTGAGCCTCCGGTCCCTCTACAAGCGTTCCTCAGGCCATGTGACCTTCACCATGGAC 23 05 

Qy 1861 CCCATCCGTGACCTTCTCATTTGGGCCATTGTCCAGAACCGTCGGGAGCTGGCAGGAATC 192 0 

Ml III! iM! I:N M M M M M M Ml 1 1 1 1 1 II 1 1 1 1 1 1 1 1 ! I In 1 1 (IM 1 1 

Db 2306 CCCATCCGTGACCTTCTCATTTGGGCCATTGTCCAGAACCGTCGGGAGCTGGCAGGAATC 23 65 

Qy 1921 ATCTGGGCTCAGAGCCAGGACTGCATCGCAGCGGCCTTGGCCTGCAGCAAGATCCTGAAG 198 0 

IIIMMMIIIillMIIIIIMIIIMIIIMIIMIIIIIIIIIlllMIIIIIIII 

Db 2 36 6 ATCTGGGCTCAGAGCCAGGACTGCATCGCAGCGGCCTTGGCCTGCAGCAAGATCCTGAAG 24 2 5 

Qy 1981 GAACTGTCCAAGGAGGAGGAGGACACGGACAGCTCGGAGGAGATGCTGGCGCTGGCGGAG 2040 

IIIIMNIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIMMIIIIIIIII 

Db 24 2 6 GAACTGTCCAAGGAGGAGGAGGACACGGACAGCTCGGAGGAGATGCTGGCGCTGGCGGAG 24 85 

Qy 2041 GAGT ATG AGC AC AG AG CC AT CGGGGT CTT C AC CG AG TG C T AC CGG AAG G A CG AAG AG AG A 2100 

IIMIIIIIIIIIIMIIIIIIIIIIIIIMIIIMIIMIMIIIMIIIIIIIIIIM 

Db 2486 GAGTATGAGCACAGAGCCATCGGGGTCTTCACCGAGTGCTACCGGAAGGACGAAGAGAGA 2545 

Qy 2101 GCCCAGAAACTGCTCACCCGCGTGTCCGAGGCCTGGGGGAAGACCACCTGCCTGCAGCTC 2 16 0 

I N I I I I II I I I I I I I I I I I I I I I I I | | | ! | | | | | | | ! | | | | | | | I | | | | | | | | | | | | | | 
Db 2546 GCCCAGAAACTGCTCACCCGCGTGTCCGAGGCCTGGGGGAAGACCACCTGCCTGCAGCTC 2605 

Qy 2161 GCCCTGGAGGCCAAGGACATGAAGTTTGTGTCTCACGGGGGCATCCAGGCCTTCCTGACC ■ 22 2 0 

Mil' !! ' h'll ll 1 1 , ! 1 1 , I i , I ! || ,'![[ 1 1 1 1 1 1 1 1 1 1| || l| || || || || 

Db 2 506 GCCCTGGAGGCCAAGGACATGAAGTTTGTGTCTCACGGGGGCATCCAGGCCTTCCTGACC 2665 

Qy 2221 AAGGTGTGGTGGGGCCAGCTCTCCGTGGACAATGGGCTGTGGCGTGTGACCCTGTGCATG 22 8 0 

Db 2 666 AAGGTGTGGTGGGGCCAGCTCTCCGTGGACAATGGGCTGTGGCGTGTGACCCTGTGCATG 2 72 5 

Qy 2281 CTGGCCTTCCCGCTGCTCCTCACCGGCCTCATCTCCTTCAGGGAGAAGAGGCTGCAGGAT 234 0 

M ! ' ' . |,| 

Db 27 26 CTGGCCTTCCCGCTGCTCCTCACCGGCCTCATCTCCTTCAGGGAGAAGAGGCTGCAGGAT 2 7 85 

Qy 2 341 GTGGGCACCCCCGCGGCCCGCGCCCGTGCCTTCTTCACCGCACCCGTGGTGGTCTTCCAC 24 0 0 

I III- I I . I : i i 1 I 

Db 2 7 86 GTGGGCACCCCCGCGGCCCGCGCCCGTGCCTTCTTCACCGCACCCGTGGTGGTCTTCCAC 2 84 5 

Qy 24 01 CTGAACATCCTCTCCTACTTCGCCTTCCTCTGCCTGTTCGCCTACGTGCTCATGGTGGAC 24 6 0 

ii in mi in 1 1 ii 1 1 mi ii n mi mm ii minimi minimi mi 

Db 2846 CTGAACATCCTCTCCTACTTCGCCTTCCTCTGCCTGTTCGCCTACGTGCTCATGGTGGAC 2905 

Qy 2 4 61 TTCCAGCCTGTGCCCTCCTGGTGCGAGTGTGCCATCTACCTCTGGCTCTTCTCCTTGGTG 252 0 

M 1 1 1 III I II M 1 1 II 1 1 M I II 1 1 M 1 1 1 M M II I M I M M M 1 1 1 M 1 1 M I Ml 

Db 2 90 6 TTCCAGCCTGTGCCCTCCTGGTGCGAGTGTGCCATCTACCTCTGGCTCTTCTCCTTGGTG 2 965 

Qy 25 21 TGCGAGGAGATGCGGCAGCTCTTCTATGACCCTGACGAGTGCGGGCTGATGAAGAAGGCA 2 580 

MIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIMIIIIIMIIIIIIIlim 

Db 2 966 TGCGAGGAGATGCGGCAGCTCTTCTATGACCCTGACGAGTGCGGGCTGATGAAGAAGGCA 3 025 

Qy 2581 GCCTTGTACTTCAGTGACTTCTGGAATAAGCTGGACGTCGGCGCAATCTTGCTCTTCGTG 2640 

iimiiMiiiiiiiiiiiiiiiiiiMiMiiiiiiiiiiiiiiii mini 

Db 3 026 GCCTTGTACTTCAGTGACTTCTGGAATAAGCTGGACGTCGGCGCAATCTTGCTCTTCGTG 3085 

Qy 2641 GCAGGGCTGACCTGCAGGCTCATCCCGGCGACGCTGTACCCCGGGCGCGTCATCCTCTCT 2 70 0 

IIIIIMMIIIIMMIMMMMIIMMIIMMIMMMMMMIMMMM 

Db 3 0 86 GCAGGGCTGACCTGCAGGCTCATCCCGGCGACGCTGTACCCCGGGCGCGTCATCCTCTCT 314 5 

Qy 2 701 CTGGACTTCATCCTGTTCTGCCTCCGGCTCATGCACATTTTTACCATCAGTAAGACGCTG 2 76 0 

Mill II II MM MM I II II I II II II Ml I Mill 1 1 1 1 1 1 1 f 1 1 1 f M I III! Ill 

Db 3146 CTGGACTTCATCCTGTTCTGCCTCCGGCTCATGCACATTTTTACCATCAGTAAGACGCTG 3205 



Qy 


2 7 61 GGGCCCAAGATCATCATTGTGAAGCGGATGATGAAGGACGTCTTCTTCTTCCTCTTCCTG 2 82 0 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIII 

3206 GGGCCCAAGATCATCATTGTGAAGCGGATGATGAAGGACGTCTTCTTCTTCCTCTTCCTG 3265 


Db 


Qy 


2 8 21 CTGGCTGTGTGGGTGGTGTCCTTCGGGGTGGCCAAGCAGGCCATCCTCATCCACAACGAG 2 88 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMMIIIIMIIIIIMIIIII 

3 2 6 6 CTGGCTGTGTGGGTGGTGTCCTTCGGGGTGGCCAAGCAGGCCATCCTCATCCACAACGAG 33 2 5 


Db 


Qy 


2 8 81 CGCCGGGTGGACTGGCTGTTCCGAGGGGCCGTCTACCACTCCTACCTCACCATCTTCGGG 2940 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 f I 1 1 1 1 
l 1 I II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I I I I I ! I | | | | | | | | | | | | | | | | | [ j | [ | | | | j ( j | | 

33 26 CGCCGGGTGGACTGGCTGTTCCGAGGGGCCGTCTACCACTCCTACCTCACCATCTTCGGG 3385 


Db 


Qy 


2 941 CAGATCCCGGGCTACATCGACGGTGTGAACTTCAACCCGGAGCACTGCAGCCCCAATGGC 3 0 00 

llllllllllllllllllllllllillliilllilllllllllMIIIIMIMI 

3 3 86 CAGATCCCGGGCTACATCGACGGTGTGAACTTCAACCCGGAGCACTGCAGCCCCAATGGC 344 5 


Db 


Qy 


3 0 01 ACCGACCCCTACAAGCCTAAGTGCCCCGAGAGCGACGCGACGCAGCAGAGGCCGGCCTTC 3060 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 i 1 1 1 1 1 I 1 1 1 I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 i 1 1 ! 1 1 1 N II 1 1 i 1 1 


Db 


1 ! 1 1 ' 1 N 1 I 1 II 1 1 M 1 1 1 1 1 1 | 1 | | | M 1 1 1 1 1 1 I II 1 I I I I 1 M 1 M | I J j | | | 1 | | 
344 6 ACCGACCCCTACAAGCCTAAGTGCCCCGAGAGCGACGCGACGCAGCAGAGGCCGGCCTTC 3505 


Qy 


3 061 CCTGAGTGGCTGACGGTCCTCCTACTCTGCCTCTACCTGCTCTTCACCAACATCCTGCTG 312 0 

1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II i 1 1 1 1 1 1 1 1 1 1 
' 1 I 1 II 1 1 ) M 1 1 II 1 1 1 1 II 1 ! 1 1 f 1 1 I I I | | | | | [ | | | | M | | | | | | 1 | 1 j | | | j [ 

3506 CCTGAGTGGCTGACGGTCCTCCTACTCTGCCTCTACCTGCTCTTCACCAACATCCTGCTG 3565 


Db 


Qy 


3121 CTCAACCTCCTCATCGCCATGTTCAACT AC ACCTTCCAGCAGGTGC AGGAGCACACGGAC 3180 


Db 


IIIIIIIIIIMIIIIIIIMIIIIIIMIIIIMMMIIIIIIIIIIIIIIIIIIIII 

3 566 CTCAACCTCCTCATCGCCATGTTCAACTACACCTTCCAGCAGGTGCAGGAGCACACGGAC 3 62 5 


Qy 


3181 CAGATTTGGAAGTTCCAGCGCCATGACCTGATCGAGGAGTACCACGGCCGCCCCGCCGCG 3 24 0 
1 1 1 1 M M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 


Db 


1 1 f 1 1 l 1 • 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 f ! II 1 1 II 1 1 1 1 1 1 1 1 1 1 | | } | | [ | | | | 1 1 | | | | 1 
3626 CAGATTTGGAAGTTCCAGCGCCATGACCTGATCGAGGAGTACCACGGCCGCCCCGCCGCG 3685 


Qy 


3241 CCGCCCCCCTTCATCCTCCTCAGCCACCTGCAGCTCTTCATCAAGAGGGTGGTCCTGAAG 33 0 0 

M II Mil i III 1 Ihl I!! 1: M |i 1 1 1 1 in |i! | , | Mi|i| | [M ||; , 1 1 1| 

3 6 86 CCGCCCCCCTTCATCCTCCTCAGCCACCTGCAGCTCTTCATCAAGAGGGTGGTCCTGAAG 3 74 5 


Db 


Qy 


33 01 ACTCCGGCCAAGAGGCACAAGCAGCTCAAGAACAAGCTGGAGAAGAACGAGGAGGCGGCC 3 36 0 

immmmimiiiimiiiiiiiiiiiiiimmiiiiiiiiiiiiiihiiiiiiii * 


Db 


3746 ACTCCGGCCAAGAGGCACAAGCAGCTCAAGAACAAGCTGGAGAAGAACGAGGAGGCGGCC 3805 


Qy 


33 61 CTGCTATCCTGGGAGATCTACCTGAAGGAGAACTACCTCCAGAACCGACAGTTCCAGCAA 3420 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 II 1 1 
1 1 > 1 1 1 1 N 1 M 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | | 

3 8 06 CTGCTATCCTGGGAGATCTACCTGAAGGAGAACTACCTCCAGAACCGACAGTTCCAGCAA 3 865 


Db 


Qy 


34 21 AAGCAGCGGCCCGAGCAGAAGATCGAGGACATCAGCAATAAGGTTGACGCCATGGTGGAC 34 8 0 

1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I l l l l I I I I i i i ii i i 
1 1 ' i 1 1 'I 1 1 1 1 1 1 'I M 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I 1 1 i 1 1 1 II | M | | | 

3 8 6 6 AAGCAGCGGCCCGAGCAGAAGATCGAGGACATCAGCAATAAGGTTGACGCCATGGTGGAC 3 925 


Db 


Qy 


34 81 CTGCTGGACCTGGACCCACTGAAGAGGTCGGGCTCCATGGAGCAGAGGTTGGCCTCCCTG 354 0 


Db 


IIIIIMIIIIIIMIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIMIIIIII 

3926 CTGCTGGACCTGGACCCACTGAAGAGGTCGGGCTCCATGGAGCAGAGGTTGGCCTCCCTG 3 985 


Qy 


3 541 GAGGAGCAGGTGGCCCAGACAGCCCGAGCCCTGCACTGGATCGTGAGGACGCTGCGGGCC 36 0 0 


Db 


MIIMIIIIIIIIIIIIIIIIIIIIIIimillMIIMIIIIIIIIIIIIIIIMII 

3 986 GAGGAGCAGGTGGCCCAGACAGCCCGAGCCCTGCACTGGATCGTGAGGACGCTGCGGGCC 4 04 5 


Qy 


3 6 01 AGCGGCTTCAGCTCGGAGGCGGACGTCCCCACTCTGGCCTCCCAGAAGGCCGCGGAGGAG 3 660 

1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 
1 1 ' ' 'I I I I » i I I I 1 1 1 M 1 1 1 1 1 I 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | | | | | | | j | | 

4 04 6 AGCGGCTTCAGCTCGGAGGCGGACGTCCCCACTCTGGCCTCCCAGAAGGCCGCGGAGGAG 4105 


Db 


Qy 


3 661 CCGGATGCTGAGCCGGGAGGCAGGAAGAAGACGGAGGAGCCGGGCGACAGCTACCACGTG 3 72 0 

1 1 1 1 1 I 1 1 1 1 1 I 1 i 1 i 1 1 1 1 1 II 1 1 M 1 i 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 
I i I I I I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 M 1 1 1 i 1 1 1 1 1 1 | M | | | | | 1 j | II I | | || | 

410 6 CCGGATGCTGAGCCGGGAGGCAGGAAGAAGACGGAGGAGCCGGGCGACAGCTACCACGTG 4165 


Db 


Qy 


3 721 AATGCCCGGCACCTCCTCTACCCCAACTGCCCTGTCACGCGCTTCCCCGTGCCCAACGAG 3 78 0 


Db 


lllllllllllllililllllliMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMI 

416 6 AATGCCCGGCACCTCCTCTACCCCAACTGCCCTGTCACGCGCTTCCCCGTGCCCAACGAG 4 22 5 


Qy 


3 7 81 AAGGTGCCCTGGGAGACGGAGTTCCTGATCTATGACCCACCCTTTTACACGGCAGAGAGG 384 0 


Db 


MIIIIIIIIIIIMMIIIIMIIIIIMIIIIIIIIIIMiiiiiiiiMIIIIIIII 

4 22 6 AAGGTGCCCTGGGAGACGGAGTTCCTGATCTATGACCCACCCTTTTACACGGCAGAGAGG 4 2 85 



Qy 3 841 AAGGACGCGGCCGCCATGGACCCCATGGGAGACACCCTGGAGCCACTGTCCACGATCCAG 3 90 0 

Ano MIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 4 286 AAGGACGCGGCCGCCATGGACCCCATGGGAGACACCCTGGAGCCACTGTCCACGATCCAG 4 34 5 

®y 3901 TACAACGTGGTGGATGGCCTGAGGGACCGCCGGAGCTTCCACGGGCCGTACACAGTGCAG 3 96 0 

, 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 ! I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

Db 4346 TACAACGTGGTGGATGGCCTGAGGGACCGCCGGAGCTTCCACGGGCCGTACACAGTGCAG 4 4 05 

Qy 3 961 GCCGGGTTGCCCCTGAACCCCATGGGCCGCACAGGACTGCGTGGGCGCGGGAGCCTCAGC 4 02 0 

M M 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 4 406 GCCGGGTTGCCCCTGAACCCCATGGGCCGCACAGGACTGCGTGGGCGCGGGAGCCTCAGC 4465 

Qy 4 021 TGCTTCGGACCCAACCACACGCTGTACCCCATGGTCACGCGGTGGAGGCGGAACGAGGAT 4 080 

■ . I ■ 1 1 1 1 ' I • : ' I I I , ' I ; | 

Db 4 46 6 TGCTTCGGACCCAACCACACGCTGTACCCCATGGTCACGCGGTGGAGGCGGAACGAGGAT 4 52 5 

Qy 4 081 GGAGCCATCTGCAGGAAGAGCATAAAGAAGATGCTGGAAGTGCTGGTGGTGAAGCTCCCT 414 0 

I I I I I I I I I I I I I I I I I I I | I | | | | M I I I I I I I I I M I I I I II I I I I M I | | | | | | | M 
Db 4 52 6 GGAGCCATCTGCAGGAAGAGCATAAAGAAGATGCTGGAAGTGCTGGTGGTGAAGCTCCCT 4 58 5 

^ 4141 CTCTCCGAGCACTGGGCCCTGCCTGGGGGCTCCCGGGAGCCAGGGGAGATGCTACCTCGG 4 20 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | 
Db 4586 CTCTCCGAGCACTGGGCCCTGCCTGGGGGCTCCCGGGAGCCAGGGGAGATGCTACCTCGG 4 64 5 

Qy 4201 AAGCTGAAGCGGATCCTCCGGCAGGAGCACTGGCCGTCTTTTGAAAACTTGCTGAAGTGC 4260 

I I I I I I I I I I I I M I I I II I I I I I I I I I I | | | | | I I I I I I I I I | | | | | | | | | | | | | | | | I 
Db 4 64 6 AAGCTGAAGCGGATCCTCCGGCAGGAGCACTGGCCGTCTTTTGAAAACTTGCTGAAGTGC 4 70 5 

Qy 4261 GG CATGG AG GTGT AC AAAG G CTAC ATGGATG A CCCGAGGAAC A CGG AC AATGCCTGG AT C 432 0 

I I i I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I | | | | | | | | | | | | | | | M I I I 
Db 4706 GGCATGGAGGTGTACAAAGGCTACATGGATGACCCGAGGAACACGGACAATGCCTGGATC 4765 

^ r 4321 GAGACGGTGGCCGTCAGCGTCCACTTCCAGGACCAGAATGACGTGGAGCTGAACAGGCTG 43 8 0 

IIIIIMIMIMIIIIIIIIIIIIIMIIIIIIIIIIlllllMiiiMMiiiiiiii 

Db 4 766 GAGACGGTGGCCGTCAGCGTCCACTTCCAGGACCAGAATGACGTGGAGCTGAACAGGCTG 4 82 5 



Qy 43 81 AACTCTAACCTGCACGCCTGCGACTCGGGGGCCTCCATCCGATGGCAGGTGGTGGACAGG 444 0 

'''llllillllllMlillllllllllllMIMIIIIIIIIIIIIIIIIIMIIIIII 

AACTCTAACCTGCACGCCTGCGACTCGGGGGCCTCCATCCGATGGCAGGTGGTGGACAGG 4 8 85 



Db 4826 



Qy 44 41 CGCATCCCACTCTATGCGAACCACAAGACCCTCCTCCAGAAGGCAGCCGCTGAGTTCGGG 450 0 

INI Mill IIIIIIMIIIIIIIIMI III M ill IIIMIMIIIIIIII III II III 

Db 48 86 CGCATCCCACTCTATGCGAACCACAAGACCCTCCTCCAGAAGGCAGCCGCTGAGTTCGGG 4 94 5 

Qy 4 5 01 GCTCACTACTGA 4 512 

IIIMIIIMII 

Db 4946 GCTCACTACTGA 4 95 7 

RESULT 6 
AB001535 

LOCUS AB001535 6220 bp mRNA linear PRI 28-NOV-1998 

DEFINITION Homo sapiens mRNA, complete cds . 
ACCESSION ABO 015 3 5 

VERSION AB001535.1 GI:3928755 

KEYWORDS KNP3 . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo 
REFERENCE 1 (sites) 

AUTHORS Nagamine,K. , Kudoh,J., Minoshima, S . , Kawasaki , K . , Asakawa,S., 

Ito,F. and Shimizu,N. 
TITLE Molecular cloning of a novel putative Ca2+ channel protein (TRPC7) 

highly expressed in brain 
JOURNAL Genomics 54 (1) , 124-131 (1998) 
MEDLINE 99026133 
PUBMED 9806837 
REFERENCE 2 (bases 1 to 6220) 
AUTHORS Shimizu,N. 



TITLE 
JOURNAL 



FEATURES 

source 



gene 



CDS 



Direct Submission 

Submitted (28-FEB-1997) Nobuyoshi Shimizu, Keio University School 
of Medicine, Department of Molecular Biology; 3 5 Shinanomachi 
Shinjuku-ku, Tokyo 160, Japan (E-mail : shimizu@dmb .med . keio ac'ip 
Tel:03-3351-2370, Fax:03-3351-23 70) ' 

Location/Qualifiers 

1. .6220 

/organism="Homo sapiens" 
/mol_t ype = " mRNA " 
/db_xref ="taxon: 9606" 
/chromosome^ "21" 
/map="21q22 .3" 
/tissue_type="brain" 
/dev_stage="20-26weeks fetus" 

/tissue_lib= "human fetal brain S'-STRECH Plus cDNA library 

(CLONTECH) and Human Brain, caudate nucleus S'-STRECH cDNA 

library (CLONTECH) " 

/note = !, 7 Caucasians" 

1. .6220 

/gene="KNP3" 

446. .4957 

/gene="KNP3 " 

/note="similar to C.elegans hypothetical protein 

CET01H8.1,CEC05C12.3,CEF54D1.5. similar to trp and 

trp-like proteins" 

/codon_start=l 

/protein_id="BAA34700 .1" 

/db_xref = "GI : 3 92 8 75 6" 

/translat ion= ''MEPSALRKAGSEQEEGFEGLPRRVTDLGMVSNLRRSNSSLFKSW 

RLQCPFGNNDKQESLSSWIPENIKKKECVYFVESSKLSDAGKWCQCGYTHEQHLEEA 

TKPHTFQGTQWDPKKHVQEMPTDAFGD.TVFTGLSQKVKKYVRVSQDTPSSVIYHLMTQ 

HWGLDVPNLLISVTGGAKNFNMKPRLKSIFRRGLVKVAQTTGAWIITGGSHTGVMKQV 

GEAVRDFSLSSSYKEGELITIGVATWGTVHRREGLIHPTGSFPAEYILDEDGQGNLTC 

LDSNHSHFILVDDGTHGQYGVEIPLRTRLEKFISEQTKERGGVAIKIPIVCWLEGGP 

GTLHTIDNATTNGTPCW/EGSGRVADVIAQVANLPVSDITISLIQQKLSVFFQEMFE 

TFTESRIVEWTKKIQDIVRRRQLLTVFREGKDGQQDVDVAILQALLKASRSQDHFGHE 

NWDHQLKLAVAWNRVDIARSEIFMDEWQWKPSDLHPTMTAALISNKPEFVKLFLENGV 

QLKEFVTWDTLLYLYENLDPSCLFHSKLQKVLVEDPERPACAPAAPRLQMHHVAQVLR 

ELLGDFTQPLYPRPRHNDRLRLLLPVPHVKLNVQGVSLRSLYKRSSGHVTFTMDPIRD 

LLIWAIVQNRRELAGIIWAQSQDCIAAALACSKILKELSKEEEDTDSSEEMLALAEEY 

EHRAIGVFTECYRKDEERAQKLLTRVSEAWGKTTCLQLALEA'KDMKFVSHGGIQAFLT 

KVWWGQLSVDNGLWRVTLCMIAFPLLLTGLISFREKRLQDVGTPAARARAFFTAPVW 

FHLNILSYFAFLCLFAYVLMVDFQPVPSWCECAIYLWLFSLVCEEMRQLFYDPDECGL 

MKKAALYFSDFWNKLDVGAILLFVAGLTCRLIPATLYPGRVILSLDFILFCLRLMHIF 

TISKTLGPKIIIVKRMMKDVFFFLFLLAVWWSFGVAKQAILIHNERRVDWLFRGAVY 

HSYLTIFGQIPGYIDGVNFNPEHCSPNGTDPYKPKCPESDATQQRPAFPEWLTVLLLC 

LYLLFTNILLLNLLIAMFNYTFQQVQEHTDQIWKFQRHDLIEEYHGRPAAPPPFILLS 

HLQLFIKRWLKTPAKRHKQLKNKLEKNEEAALLSWEIYLKENYLQNRQFQQKQRPEQ 

KIEDISNKVDAMVDLLDLDPLKRSGSMEQRLASLEEQVAQTARALHWIVRTLRASGFS 

SEADVPTLASQKAAEEPDAEPGGRKKTEEPGDSYHVNARHLLYPNCPVTRFPVPNEKV 

PWETEFLIYDPPFYTAERKDAAAMDPMGDTLEPLSTIQYNWDGLRDRRSFHGPYTVQ 

AGLPLNPMGRTGLRGRGSLSCFGPNHTLYPMVTRWRRNEDGAICRKSIKKMLEVLWK 

LPLSEHWALPGGSREPGEMLPRKLKRILRQEHWPSFENLLKCGMEVYKGYMDDPRNTD 

NAWIETVAVSVHFQDQNDVELNRLNSNLHACDSGAS1RWQWDRRIPLYANHKTLLOK 
AAAEFGAHY" 



ORIGIN 



Query Match 100.0%; Score 4512; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 4512; Conservative 0; Mismatches 



DB 9; Length 62 20; 



Indels 



Gaps 



Qy 


l 


ATGGAGCCCTCAGCCCTGAGGAAAGCTGGCTCGGAGCAGGAGGAGGGCTTTGAGGGGCTG 

IIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIMIII 

ATGGAGCCCTCAGCCCTGAGGAAAGCTGGCTCGGAGCAGGAGGAGGGCTTTGAGGGGCTG 


60 


Db 


446 


505 


Qy 


61 


CCCAGAAGGGTCACTGACCTGGGGATGGTCTCCAATCTCCGGCGCAGCAACAGCAGCCTC 

ill linn ii i nun mi mini niim mi in mi ii mini 

CCCAGAAGGGTCACTGACCTGGGGATGGTCTCCAATCTCCGGCGCAGCAACAGCAGCCTC 


120 


Db 


506 


565 


Qy 


121 


TTCAAGAGCTGGAGGCTACAGTGCCCCTTCGGCAACAATGACAAGCAAGAAAGCCTCAGT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | I I I 1 1 II || 1 1 II 1 


180 



566 TTCAAGAGCTGGAGGCTACAGTGCCCCTTCGGCAACAATGACAAGCAAGAAAGCCTCAGT 625 
181 TCGTGGATTCCTGAAAACATCAAGAAGAAAGAATGCGTGTATTTTGTGGAAAGTTCCAAA 24 0 

^ MIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIII'llllllllllllllMIIIII 

62 6 TCGTGGATTCCTGAAAACATCAAGAAGAAAGAATGCGTGTATTTTGTGGAAAGTTCCAAA 685 

2 4 1 CTGTCTGATGCTGGGAAGGTGGTGTGTCAGTGTGGCTACACGCATGAGCAGCACTTGGAG 300 

_ IIIIIIIIIIIIMIMIIIilililllllllllMMIIIIMIIilllilllllllll 

6 8 6 CTGTCTGATGCTGGGAAGGTGGTGTGTCAGTGTGGCTACACGCATGAGCAGCACTTGGAG 74 5 

301 GAGGCTACCAAGCCCCACACCTTCCAGGGCACACAGTGGGACCCAAAGAAACATGTCCAG 360 

^ 11111111111111111111 

746 GAGGCTACCAAGCCCCACACCTTCCAGGGCACACAGTGGGACCCAAAGAAACATGTCCAG 805 

3 61 GAGATGCCAACCGATGCCTTTGGCGACATCGTCTTCACGGGCCTGAGCCAGAAGGTGAAA 4 2 0 

Qne MINI llllllllll II II Mill INI || Ml in || mi Mm || || || mi 1 1 

8 0 6 GAGATGCCAACCGATGCCTTTGGCGACATCGTCTTCACGGGCCTGAGCCAGAAGGTGAAA 8 6 5 

4 21 AAGTACGTCCGAGTCTCCCAGGACACGCCCTCCAGCGTGATCTACCACCTCATGACCCAG 4 8 0 

QCC N 1 1 f 1 1 1 1 1 1 1 1 1 M ] I M 1 1 II 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 ] 1 1 1 M ( 1 1 { I { 1 1 1 M I 

8 6 6 AAGTACGTCCGAGTCTCCCAGGACACGCCCTCCAGCGTGATCTACCACCTCATGACCCAG 925 
4 81 CACTGGGGGCTGGACGTCCCCAATCTCTTGATCTCGGTGACCGGGGGGGCCAAGAACTTC 54 0 

i r 1 1 r i r 1 1 j i f i j i j 1 1 1 1 1 j 1 1 1 r f r 1 1 1 1 1 1 r f 1 1 1 f i r f i f f i f j 1 1 r 1 1 1 r i j 1 1 

926 CACTGGGGGCTGGACGTCCCCAATCTCTTGATCTCGGTGACCGGGGGGGCCAAGAACTTC 9 8 5 
541 AACATGAAGCCGCGGCTGAAGAGCATTTTCCGCAGAGGCCTGGTCAAGGTGGCTCAGACC 60 0 

_ MIIIIIIIIMMMMIIIIMMMMIMIIIMMMMMIMMMMIMM 

986 AACATGAAGCCGCGGCTGAAGAGCATTTTCCGCAGAGGCCTGGTCAAGGTGGCTCAGACC 1045 
6 01 ACAGGGG C C TG G ATCATC AC AGGGGGGT C CC AC AC CGGCGTCATG AAGC AGG T AGG CG AG 660 

lft MIMIMMMMMMMMMMMMMIIMMMMIIMMMMMMMM 

1046 ACAGGGGCCTGGATCATCACAGGGGGGTCCCACACCGGCGTCATGAAGCAGGTAGGCGAG 1105 
661 GCGGTGCGGGACTTCAGCCTGAGCAGCAGCTACAAGGAAGGCGAGCTCATCACCATCGGA 720 

I N 1 1 1 1 1 M 1 1 1 M M I M M M 1 1 1 M 1 1 II M 1 1 M I M M M 1 1 1 II M M 1 1 II I 

1106 GCGGTGCGGGACTTCAGCCTGAGCAGCAGCTACAAGGAAGGCGAGCTCATCACCATCGGA 116 5 
721 GTCGCCACCTGGGGCACTGTCCACCGCCGCGAGGGCCTGATCCATCCCACGGGCAGCTTC 780 

_ > 1 1 1 1 1 1 M f ! I f 1 1 1 1 i 1 1 M 1 1 f I M 1 II I M M M 1 1 1 M f 1 1 1 1 1 1 M 1 1 1 1 1 II I 

1166 GTCGCCACCTGGGGCACTGTCCACCGCCGCGAGGGCCTGATCCATCCCACGGGCAGCTTC 122 5 
781 CCCGCCGAGTACATACTGGATGAGGATGGCCAAGGGAACCTGACCTGCCTAGACAGCAAC 84 0 

„ IMIMMIIIIIIIIIIIIIMIIMMIIMIIMMIIIMIIMIIIIMIMIII 

12 2 6 CCCGCCGAGTACATACTGGATGAGGATGGCCAAGGGAACCTGACCTGCCTAGACAGCAAC 12 85 

841 CACTCTCACTTCATCCTCGTGGACGACGGGACCCACGGCCAGTACGGGGTGGAGATTCCT 9 00 

f I I I [ I 1 1 I 1 I I f I I I I f f I f I I I f f I I I f I I I I I I I I I I I I I I I I | f f I f f I I f I I I I I 

12 86 CACTCTCACTTCATCCTCGTGGACGACGGGACCCACGGCCAGTACGGGGTGGAGATTCCT 134 5 

901 CTGAGGACCAGGCTGGAGAAGTTCATATCGGAGCAGACCAAGGAAAGAGGAGGTGTGGCC 96 0 

IIHMMIIMMIMMIMIMMMIMIMMMIMMMIIMMMMMM 

13 4 6 C TG AGG AC C AGG CTGG AG AAGT T CAT AT CGG AG C AG A C C AAGGAAAG AGG AG GT G TG G C C 14 05 
961 ATCAAGATCCCCATCGTGTGCGTGGTGCTGGAGGGCGGCCCGGGCACGTTGCACACCATC 102 0 

nc MIMIIIMMIIIIIMMIMMIIMIIMIIIIIIMIIIIIIMMIIMIIM 

1406 ATCAAGATCCCCATCGTGTGCGTGGTGCTGGAGGGCGGCCCGGGCACGTTGCACACCATC 1465 
10 21 GACAACGCCACCACCAACGGCACCCCCTGTGTGGTTGTGGAGGGCTCGGGCCGCGTGGCC 10 80 

N 1 1 1 f 1 1 1 1 1 1 1 II f 1 1 1 1 1 ] 1 1 1 1 1 1 1 1 i 1 1 1 M 1 1 f 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

14 6 6 GACAACGCCACCACCAACGGCACCCCCTGTGTGGTTGTGGAGGGCTCGGGCCGCGTGGCC 1525 
1081 



GACGTCATTGCCCAGGTGGCCAACCTGCCTGTCTCGGACATCACTATCTCCCTGATCCAG 114 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II II I II I II 

152 6 GACGTCATTGCCCAGGTGGCCAACCTGCCTGTCTCGGACATCACTATCTCCCTGATCCAG 15 85 
1141 CAGAAACTGAGCGTGTTCTTCCAGGAGATGTTTGAGACCTTCACGGAAAGCAGGATTGTC 1200 

I M 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II M 1 1 1 1 

1586 CAGAAACTGAGCGTGTTCTTCCAGGAGATGTTTGAGACCTTCACGGAAAGCAGGAT^ 1645 
1201 GAGTGGACCAAAAAGATCC AAGATATTGTCCGGAGGCGGCAGCTGCTGACTGTCTTCCGG 



1260 



Db 1646 GAGTGGACCAAAAAGATCCAAGATATTGTCCGGAGGCGGCAGCTGCTGACTGTCTTCCGG 



miiimimiiiii 

1705 



Q y 1261 GMGGC^ 1320 

Db 1706 GAAGGCAAGGATGGTCAGCAGGACGTGGATOT 1765 

Qy 1321 TCACGGAGCCAAGACCACTTTGGCCACGAGAACTGGGACCACCAGCTGAAACTGGCAGTG 13 8 0 



Db 1766 TCACGGAGCCAAGACCACTTTGGCCACGAGAACTGGGACCACCAGCTGAAACTGGCAGTG 182 5 

Qy 1381 GCATGGAATCGCGTGGACATTGCCCGCAGTGAGATCTTCATGGATGAGTGGCAGTGGAAG 1440 



I N I I I I I I I M I I I I I I I I I I | | | | | | | I I I I I I I I | | | | | | I 



Db 182 6 GCATGGAATCGCGTGGACATTGCCCGCAGTGAGATCTTCATGGATGAGTGGCAGTGGAAG 18 8 5 

Qy 1441 CCTTCAGATCTGCACCCCACGATGACAGCTGCACTCATCTCCAACAAGCCTGAGTTTGTG 15 0 0 

'llllllllllllilllMIIIIIIMllllliMIIIIIIIIIIIIMIIIIIIIIIII 

Db 1886 CCTTCAGATCTGCACCCCACGATGACAGCTGCACTCATCTCCAACAAGCCTGAGTTTGTG 1945 

Qy 15 01 AAGCTCTTCCTGGAAAACGGGGTGCAGCTGAAGGAGTTTGTCACCTGGGACACCTTGCTC 15 6 0 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M I ! 1 1 1 M 1 1 1 1 1 1 1 1 1 

Db 1946 AAGCTCTTCCTGGAAAACGGGGTGCAGCTGAAGGAGTTTGTCACCTGGGACACCTTGCTC 2005 

Qy 1561 TACCTGTACGAGAACCTGGACCCCTCCTGCCTGTTCCACAGCAAGCTGCAAAAGGTGCTG 1620 



Db 2 006 TACCTGTACGAGAACCTGGACCCCT 2065 

Qy 1621 GTGGAGGATCCCGAGCGCCCGGCTTGCGCGCCCGCGGCGCCCCGCCTGCAGATGCACCAC 1680 

I' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I | | | I I I I I I I I I I I I I I I I I I I 

Db 2066 GTGGAGGATCCCGAGCGCCCGGCTTGCGCGCCCGCGGCGCCCCGCCTGCAGATGCACCAC 2125 

Qy 1681 GTGGCCCAGGTGCTGCGGGAGCTGCTGGGGGACTTCACGCAGCCGCTTTATCCCCGGCCC 174 0 



Db 212 6 GTGGCCCAGGTGCTGCGGGAGCTGCTGG^ 2185 

QY 1741 CGGCA^^ 180Q 

Db 218 6 CGGCACAACGACCGGCTGCGGCTCCTGCTGCCCGTTCC^ 2 24 5 

Qy 18 01 GGAGTGAGCCTCCGGTCCCTCTACAAGCGTTCCTCAGGCCATGTGACCTTC^CCATGGAC 186 0 

1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1| 1 1 1 ! 



Db 224 6 GGAGTGAGCCTCCGGTCCCTCTAC 23 05 

Qy 1861 C^ATCCGTGACCT 192Q 

Db 23 06 CCCATCCGTGACCTTCTCATTTGGGCCATTGTC^ 2 3 65 

Qy 1921 ATCTGGGCTCAGAGCCAGGACTGCATCGCAGCGGCCTTGGCCTGCAGCAAGATCCTGAAG 1980 



I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | , ,,m IM , , , | : , , 

Db 2366 ATCTGGGCTCAGAGCCAGGACTGCATCGCAGCGGCCTTGGCCTGCAGCAAGATCCTGAAG 2425 

Qy 19 81 GAACTGTCCAAGGAGGAGGAGGACACGGACAGCTCGGAGGAGATGCTGGCGCTGGCGGAG 2 04 0 



i ii ii mi ii iiiiii iiiiiii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 24 26 GAACTGTCCAAGGAGGAGGAGGACACGGACAGCTCGGAGGAGATGCTGGCGCTGGCGGAG 24 85 

Qy 2041 GAGTATGAGCACAGAGCCATCGGGGTCTTCACCGAGTGCTACCGGAAGGACGAAGAGAGA 210 0 

I , | ^ ; ■ ' 

Db 24 86 GAGTATGAGCACAGAGCCATCGGGGTCTTCACCGAGTGCTACCGGAAGGACGAAGAGAGA 2 54 5 

Q y 2101 GCCCAG/^CTC 216Q 

Db 254 6 GCCCAGAAACTGCTCACCCGCGTGTCCG^ 2 60 5 

Q y 2161 GCCCTGGAGGCC 222Q 

Db 26 06 GCCCTGGAGGCCAAGGACATGAAGTTTC 2665 

Qy 2221 ^GGTGTGGTGGGGC^^ 22Q0 



Db 2 66 6 AAGGTGTGGTGGGGCCAGCTCTC^ 2725 



Q y 2281 CTGGCCTTCrc 2340 

Db 2726 CTGGCCTTCCCGCTGCTCCTC^ 2785 

Qy 2341 GTGGGCACCCCCGCGGCCCGCGCCCGTGCCTTCTTCACCGCACCCGTGGTGGTCTTCCAC 24 00 

1 1 1 1 1 ! 1 1 1 1 1 M 1 1 1 1! 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 2 786 GTGGGCACCCCCGCGGCCCGCGCCCGTGCCTTCTTCACCGCACCCGTGGTGGTCTTCCAC 2 84 5 

Qy 24 01 CTGAACATCCTCTCCTACTTCGCCTTCCTCTGCCTGTTCGCCTACGTGCTCATGGTGGAC 246 0 



Db 2846 CTGAACATCCTCTCCTACTTCGCCTTCCTCTGCCTGTTCGCCTACGTGCTCATGGTGGAC 2905 

Qy 2461 TTCCAGCCTGTGCCCTCCTGGTGCGAGTGTGCCATCTACCTCTGGCTCTTCTCCTTGGTG 2520 

I I I I I I I I I I I I I I I I I I I I I I I I M | | | | | | | | | | | | | | | | | | | | | I I I I I I I I I I I I I 
Db 2906 TTCCAGCCTGTGCCCTCCTGGTGCGAGTGTGCCATCTACCTCTGGCTCTTCTCCTTGGTG 2965 

Qy 2521 TGCGAGGAGATGCGGCAGCTCTTCTATGACCCTGACGAGTGCGGGCTGATGAAGAAGGCA 25 80 



Db 2966 TGCGAGGAGATGCGGCAGCTCTTCTATGACCCTGACGAGTGCGGGCTGATGAAGAAGGCA 3 025 

Qy 25 81 GCCTTGTACTTCAGTGACTTCTGGAATAAGCTGGACGTCGGCGCAATCTTGCTCTTCGTG 2640 

Db 3 02 6 GCCTTGTACTTCAGTG^ 3085 

Qy 2641 GCAGGGCTGACCTGCAGGCTCATCCCGGCGACGCTGTACCCCGGGCGCGTCATCCTCTCT 2700 

Db 3 086 GCAGGGCTGACCTGCAG^ 3l45 

Qy 2 7 01 CTGGACTTCATCCTGTTCTGCCTCCGGCTCATGCACATTTTTACCATCAGTAAGACGCTG 2 760 

Db 3146 CTGGACTTCATCCTGTTCTGCOT 3205 

Qv 2 761 GGGCCCAAGATCATCATTGTGAAGCGGATGATGAAGG 2 82 0 

Db 32 06 GGGCCCAAGATCATCATTGTGAAGCGGA^ 3265 

Qy 2 821 CTGGCTGTOT 2880 

Db 32 66 CTGGCTGTGTGGGTGGTGTCCT 3325 

Qy 2881 CGCCGGGTGGACTGGCTGTTCCGAGGGGCCGTCTACCACTCCTACCTCACCATCTTCGGG 2940 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 ; 1 1 1 1 1 1 

Db 3 32 6 CGCCGGGTGGACTGGCTGTTCCGAGGGGCCGTCTACCACTCCTACCTCACCATCTTCGGG 3 385 

Qy 2941 CAGATCCCGGGCTACATCGACGGTGTGAACTTCAACCCGGAGCACTGCAGCCCCAATGGC 3000 

''Mil.' 

Db 33 8 6 CAGATCCCGGGCTACATCGACGGTGTGAACTTCAACCCGGAGCACTGCAGCCCCAATGGC 344 5 

Qy 3 0 01 ACCGACCCCTACAAGCCTAAGTGCCCCGAGAGCGACGCGACGCAGCAGAGGCCGGCCTTC 3 06 0 



Db 344 6 ACCGACCCCTACAAGCCTAAGTGCCCCGA 3505 

Qy 3 061 CCTGAGTGGCTGACGGTCCTCCTACTCTGCCTCTACCTGCTCTTCACCAACATCCTGCTG 312 0 

I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | I | | | | | | | | | | | | | [ | | | | | | | | | I I I [ I I 

Db 3 506 CCTGAGTGGCTGACGGTCCTCCTACTCTGCCTCTACCTGCTCTTCACCAACATCCTGCTG 3 5 65 

& 3121 yTCAACCTCCTCATCGCCATGTTCAACTACACCTTCCAGCAGGTGCAGGAGCACACGGAC 318 0 

Db 3 5 66 CTCAACCTCCTCATCGCCATGTTCAACTACACCTTCCAGCAGGTGCAGGAGCACACGGAC 3 62 5 

Qy 3181 CAGATTTGGAAGTTCCAGCGCCATGACCTGATCGAGGAGTACCACGGCCGCCCCGCCGCG 324 0 



Db 3626 CAGATTTGGAAGTTCCAGCGCCATGACCTGA 3685 

Qy 3 241 CCGCCCCCCTTCATCCTCCTCAGCCACCTGCAGCTCTTCATCAAGAGGGTGGTCCTGAAG 33 00 



Db 36 86 CCGCCCCCCTTCATCCTCCTCAGCCACCTGCAGCTCTTCATCAAGAGGGTGGTCCTGAAG 3 74 5 

Qy 33 01 ACTCCGGCCAAGAGGCACAAGCAGCTCAAGAACAAGCTGGAGAAGAACGAGGAGGCGGCC 3360 

M 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | [ | | | | | | | | I I I | I I 

Db 3 74 6 ACTCCGGCCAAGAGGCACAAGCAGCTCAAGAACAAGCTGGAGAAGAACGAGGAGGCGGCC 3 8 05 



Qy 3361 CTGCTATCCTGG^ 3420 

Db 3 8 06 CTGCTATCCTGGGAGATCTACCTGAAGGAG/^CTACCTC 3865 

Qy 3421 ^CAGCGGCCCGA^ 3480 

Db 3 8 66 AAGCAGCGGCCCGAGCAGAAGATCGAGGACATCAGC 392 5 

Qy 34 81 CTGCTGGACCTGGACCCACTGAAGAGGTCGGGCTCCATGGAGCAGAGGTTGGCCTCCCTG 354 0 



Db 392 6 CTGCTGGACCTGGACCCACTGAAGA 3 9 8 5 

Qy 3541 GAGGAGCA^ 360Q 

Db 3 98 6 GAGGAGCAGGTGGCCCAGACAGCCCGAGCCCTGCACTGGAT^ 4 045 

Qy 3601 AGCGGCTTCAGCTCGGAGGCGGACGTCCCCACTCTGGCCTCCCAGAAGGCCGCGGAGGAG 3 66 0 

1 " 1 1 ; ' ' ! 1 ' " 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 



Db 4046 AGCGGCTTCAGCTCGGAGGCGGACGTCCCCACTCTGGCCTCCCAGAAGGCCGCGGAGGAG 4105 

Qy 3661 CCGGATGCTGAGCC^ 372Q 

Db 4106 CCGGATGCTGAGCCGGGAGGCAGGAAGAAGACGGAGG^ 4165 

Qy 3721 ^GCCCGGCACCT 378Q 

Db 4166 AATGCCCGGCACCTCCTCTACCCCAACTGCCCTGTCACGCGOT 422 5 

QY 3781 AAGGTGCCCTGGGAGACGGAGTTCCTGATCTATGACCCACCCTTTTACACGGCAGAGAGG 3840 



Db 4226 



Mllllll 



AAGGTGCCCTGGGAGACGGAGTTCCTGATCTATGACCCACCCTTTTACACGGCAGAGAGG 



4285 



Qy 3 841 AAGGACGCGGCCGCCATGGACCCCATGGGAGACACCCTGGAGCCACTGTCCACGATCCAG 3 9 00 



Db 4 2 86 7AAGGACGCGGCCGCCATGGA 4345 

Qy 3 901 TACAACGTGGTGGATGGCCTGAGGGACCGCCGGAGCTTCCACGGGCCGTACACAGTGCAG 3960 



■ ' I II 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 I I I I I ! I I I I I I I II I I i I I I I I I I | I I I I I I I I II 
Db 4346 TACAACGTGGTGGATGGCCTGAGGGACCGCCGGAGCTTCCACGGGCCGTACACAGTGCAG 44 05 

Qy 3961 GCCGGGTTGCCCCTGAACCCCATGGGCCGCACAGGACTGCGTGGGCGCGGGAGCCTCAGC 4020 

I I I I I I I I I I I I I M I I I I I I M I II I I I I I I I I I I I M I | I I I I I I I I I I I I I I I I I II 
Db 44 06 GCCGGGTTGCCCCTGAACCCCATGGGCCGCACAGGACTGCGTGGGCGCGGGAGCCTCAGC 44 65 

Qy 4 021 TGCTTCGGACCCAACCACACGCTGTACCCCATGGTCACGCGGTGGAGGCGGAACGAGGAT 4080 



I liii I ii in ill ii ii ii ii inn ii ii ii ii 1 1 inn n n n in 



Db 44 66 TGCTTCGGACCCAACCACACG^ 4525 

Qy 4 0 81 GGAGCCATCTGCAGGAAGAGCATAAAGAAGATGCTGGAAGTGCTGGTGGTGAAGCTCCCT 414 0 



1 ! I M I ! I ! 1 1 1 ! 1 1 1 1 1 1 ! I i 1 1 1 i 1 1 1 , , , 

Db 4 526 GGAGCCATCTGCAGGAAGAGCATAAAGAAGATGCTGGAAGTGCTGGTGGTGAAGCTCCCT 4 5 85 

Qy 4141 CTCTCCGAGCACTGGGCCCTGCCTGGGGGCTCCCGGGAGCCAGGGGAGATGCTACCTCGG 4200 



I I M I I I I I I I I I I II I I I I I I I I I j | | | | i | | j | | | | ( | | | | | | j | | i | | i i I * I i i 
Db 4 5 86 CTCTCCGAGCACTGGGCCCTGCCTGGGGGCTCCCGGGAGCCAGGGGAGATGCTACCTCGG 4 64 5 

Qy 4 2 01 AAGCTGAAGCGGATCCTCCGGCAGGAGCACTGGCCGTCTTTTGAAAACTTGCTGAAGTGC 4 26 0 

""""iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiinr 



Db 4 64 6 AAGCTGAAGCGGATCCTCCGGCAGGAG^ 4705 

Qy 4261 GGCATGGAGGTGT^ 432Q 

Db 4 70 6 GGCATGGAGGTGTACAAAGGCTACATGGATGACCCGAGG 4 7 6 5 

Qy 4321 GAGACGGTGGCCGTCA^ 43Q0 



Db 4 76 6 GAGACGGTGGCCGTCAGCGTCCACTT^ 4825 

Qy 4 3 81 AACTCTAACCTGCACGCCTGCGACTCGGGGGCCTCCATCCGATGGCAGGTGGTGGACAGG 4 44 0 



Db 



4 826 AACTCTAACCTGCACGCCTGCGACTCGGGGGCCTCCATCCGATGGCAGGTGGTGGACAGG 4 88 5 



Qy 4441 CGCATCCCACTCTATGCGAACCACAAGACCCTCCTCCAGAAGGCAGCCGCTGAGTTCGGG 4500 

IIIIIIIIIIIIIIIMIIIIIMIIIMIIIIIIIIII 

Db 4 886 CGCATCCCACTCTATGCGAACCACAAGACCCTCCTCCAGAAGGCAGCCGCTGAGTTCGGG 4 94 5 

Qy 4 5 01 GCTCACTACTGA 4 512 

IIIIIIIIIIM 

Db 4 94 6 GCTCACTACTGA 4 95 7 
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RESULT 1 
US-09-600-087-1 
, ; Sequence 1, Application US/09600087 
; Patent No. 6548272 
; GENERAL INFORMATION: 

; APPLICANT : Shimizu, No. 6548272uyoshi 
; APPLICANT: Nagamine, Kentaro 

; TITLE OF INVENTION: GENE CODING FOR A NOVEL TRANSMEMBRANE PROTEIN 

FILE REFERENCE: 11283-004001 
; CURRENT APPLICATION NUMBER: US/09/600 , 087 
; CURRENT FILING DATE: 2000-07-11 
; PRIOR APPLICATION NUMBER: PCT/JP99/06289 
; PRIOR FILING DATE: 1999-11-11 
; PRIOR APPLICATION NUMBER: JP/321200/1998 

PRIOR FILING DATE: 1998-11-12 

NUMBER OF SEQ ID NOS : 2 
; SOFTWARE : FastSEQ for Windows Version 4 . 0 
; SEQ ID NO 1 

LENGTH: 622 0 
TYPE : DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (446) . . . (4954) 
NAME /KEY: polyA_signal 
LOCATION: ( 5 8 12 ) . . . ( 5 8 17 ) 
US-09-600-087-1 



Query Match 100.0%; Score 4512; DB 4; Length 6220; 

Best Local Similarity 10 0.0%; Pred . No. 0; 

Matches 4512; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

QY 1 ATGGAGCCCTCAGCCCTGAGGAAAGCTGGCTCGGAGCAGGAGGAGGGCTTTGAGGGGCTG 6 0 

I I i I I I I I I i I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | I I I 

Db 44 6 ATGGAGCCCTCAGCCCTGAGGAAAGCTGGCTCGGAGCAGGAGGAGGGCTTTGAGGGGCTG 505 



61 CCCAGAAGGGTCACTGACCTGGGGATGGTCTCCAATCTCCGGCGCAGCAACAGCAGCCTC 12 0 

IMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIMIIIIII 

5 06 CCCAGAAGGGTCACTGACCTGGGGATGGTCTCCAATCTCCGGCGCAGCAACAGCAGCCTC 565 
121 TTCAAGAGCTGGAGGCTACAGTGCCCCTTCGGCAACAATGACAAGCAAGAAAGCCTCAGT 18 0 

i M'in; 1 i i : inn 

56 6 TTCAAGAGCTGGAGGCTACAGTGCCCCTTCGGCAACAATGACAAGCAAGAAAGCCTCAGT 62 5 
181 TCGTGGATTCCTGAAAACATCAAGAAGAAAGAATGCGTGTATTTTGTGGAAAGTTCCAAA 24 0 

' ! ■ ' ' : I III I - ! ! i ; ; 

626 T CG TGG ATT C CTG AAAAC ATC AAG AAG AAAG AATG CG TG TAT T T TG TGG AAAGT T C C AAA 685 

2 41 CTGTCTGATGCTGGGAAGGTGGTGTGTCAGTGTGGCTACACGCATGAGCAGCACTTGGAG 3 0 0 

IIIMIMIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIII 

6 86 CTGTCTGATGCTGGGAAGGTGGTGTGTCAGTGTGGCTACACGCATGAGCAGCACTTGGAG 74 5 

3 01 G AGG C T AC C AAG C C C C AC AC C TT C C AGGGC AC AC AGT GGG AC C C AAAG AAAC ATG TC C AG 3 60 

II 1 1 M I ii M M I I'M I M 1 1 1 1 M II 1 1 III M h 1 1 1 1 Mj if, 1 1 M n | (I, | 

74 6 GAGGCTACCAAGCCCCACACCTTCCAGGGCACACAGTGGGACCCAAAGAAACATGTCCAG 805 
361 GAGATGCCAACCGATGCCTTTGGCGACATCGTCTTCACGGGCCTGAGCCAGAAGGTGAAA 4 20 

M 1 1 M Mill 1 1 1 M II 1 1 1 1 MMI II II III i' h 1 1 1 IM| || , , | ;l || 1 1 |i 

806 GAGATGCCAACCGATGCCTTTGGCGACATCGTCTTCACGGGCCTGAGCCAGAAGGTGAAA 865 
421 AAGTACGTCCGAGTCTCCCAGGACACGCCCTCCAGCGTGATCTACCACCTCATGACCCAG 4 80 

M Miilill Mill MMIMM I'llllli I Mill i IMIIM 

866 AAGTACGTCCGAGTCTCCCAGGACACGCCCTCCAGCGTGATCTACCACCTCATGACCCAG 925 

4 81 CACTGGGGGCTGGACGTCCCCAATCTCTTGATCTCGGTGACCGGGGGGGCCAAGAACTTC 54 0 

l!i I MiIIm'I 1 1 1 II! 1 1 1 1 1 1 II (1 1 1 1 1 III II IM : I i|!l 1 1 , 1 1 ii IN III 

926 CACTGGGGGCTGGACGTCCCCAATCTCTTGATCTCGGTGACCGGGGGGGCCAAGAACTTC 9 85 
541 AACATGAAGCCGCGGCTGAAGAGCATTTTCCGCAGAGGCCTGGTCAAGGTGGCTCAGACC 6 0 0 

MM I l,:i Ih'l IMI 1 1 1 1 III II II ! 1 1 M ,1 1 1 II II III, I |i || ; || 1 1 1 ill 

9 3 6 AACATGAAGCCGCGGCTGAAGAGCATTTTCCGCAGAGGCCTGGTCAAGGTGGCTCAGACC 104 5 

6 01 ACAGGGGCCTGGATCATCACAGGGGGGTCCCACACCGGCGTCATGAAGCAGGTAGGCGAG 660 

I M II M':' . I M:ll I I 

104 6 ACAGGGGCCTGGATCATCACAGGGGGGTCCCACACCGGCGTCATGAAGCAGGTAGGCGAG 1105 

661 GCGGTGCGGGACTTCAGCCTGAGCAGCAGCTACAAGGAAGGCGAGCTCATCACCATCGGA 720 

1106 GCGGTGCGGGACTTCAGCCTGAGCAGCAGCTACAAGGAAGGCGAGCTCATCACCATCGGA 1165 

721 GTCGCCACCTGGGGCACTGTCCACCGCCGCGAGGGCCTGATCCATCCCACGGGCAGCTTC 7 80 

! mm:": I I I I M I I I I I I I I I I I I I 

116 6 GTCGCCACCTGGGGCACTGTCCACCGCCGCGAGGGCCTGATCCATCCCACGGGCAGCTTC 122 5 

7 81 CCCGCCGAGTACATACTGGATGAGGATGGCCAAGGGAACCTGACCTGCCTAGACAGCAAC 84 0 

1 1 IIM III !, M IN 1 1 1 1 1! II i 1 1 1 1 IM| 1 1| 1 1 M || 1 1 r | |;;i , 1 1 1 1 ;|; 

12 26 CCCGCCGAGTACATACTGGATGAGGATGGCCAAGGGAACCTGACCTGCCTAGACAGCAAC 1285 
841 CACTCTCACTTCATCCTCGTGGACGACGGGACCCACGGCCAGTACGGGGTGGAGATTCCT 90 0 

I IN MiM'M I M Ml I! I M I M U 1 1 Il ' I Ml 1 1 M ll III 1 1 ) I III 

12 8 6 CACTCTCACTTCATCCTCGTGGACGACGGGACCCACGGCCAGTACGGGGTGGAGATTCCT 134 5 
901 CTGAGGACCAGGCTGGAGAAGTTCATATCGGAGCAGACCAAGGAAAGAGGAGGTGTGGCC 960 

: 1 I I M i | M MMMM; , 

13 4 6 CTGAGGACCAGGCTGGAGAAGTTCATATCGGAGCAGACCAAGGAAAGAGGAGGTGTGGCC 14 0 5 
961 ATCAAGATCCCCATCGTGTGCGTGGTGCTGGAGGGCGGCCCGGGCACGTTGCACACCATC 102 0 

! 1 1 1 M 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 f ] I j ! 1 1 ! M 1 1 1 j| i ! 1 1 f I [ ! 1 1 1 1 1 f [ [ | j j 1 1 

14 06 ATCAAGATCCCCATCGTGTGCGTGGTGCTGGAGGGCGGCCCGGGCACGTTGCACACCATC 1465 

10 21 GACAACGCCACCACCAACGGCACCCCCTGTGTGGTTGTGGAGGGCTCGGGCCGCGTGGCC 10 8 0 

1 1 M i 1 1 1 1 1 1 i M I | | | | | M I I I I I I I I I I I I I I I I i | | ! | | | | | | | | | | | | | | | | | | 
14 66 GACAACGCCACCACCAACGGCACCCCCTGTGTGGTTGTGGAGGGCTCGGGCCGCGTGGCC 15 2 5 

10 81 GACGTCATTGCCCAGGTGGCCAACCTGCCTGTCTCGGACATCACTATCTCCCTGATCCAG 1140 

1526 GACGTCATTGCCCAGGTGGCCAACCTGCCTGTCTCGGACATCACTATCTCCCTGATCCAG 15 85 



Qy 1141 CAGAAACTGAGCGTGTTCTTCCAGGAGATGTTTGAGACCTTCACGGAAAGCAGGATTGTC 12 00 



Db 15 8 6 CAGAAACTGAGCGTGTTCTTCCAGGAGATGTTTGAGACCTTCACGGAAAGCAGGATTGTC 164 5 

Qy 12 01 GAGTGGACCAAAAAGATCCAAGATATTGTGCGGAGGCGGCAGCTGCTGACTGTCTTCCGG 12 6 0 

Db 164 6 GAGTGGACCAAAAAGATCCAAGATATTGTCCGGA^ 17 05 

Qy 1261 GAAGGCAAGGATGG 1320 

Db 1706 GAAGGCAAGGATGGTCAGCAGGACGTGGATGTGGCCATCTTGCAG^ 176 5 

Qy 1321 TCACGGAGCCAAGA 1380 

Db 1766 TCACGGAGCCAAGACCACTTTGGCCACGAGAACTGGGACCAC^ 1825 

& 1381 ^ ^1"^ ^ ^ ^ ^ ^^^^ ^ ^ CAGTG AGATC TT C ATGG ATG AGTGG C AGTGG AAG 144 0 

Db 1826 GCATGGAATCGCGTGGACATTGCCCGCAGTGAGATCTTCATG^ i 8 85 

Qy 14 41 CCTTCAGATCTGCACCCCACGATGACAGCTGCACTCATCTCCAACAAGCCTGAGTTTGTG 15 0 0 

I I I I I I I I I i I | | | | | | | | | | M I I I I I I I I | | | | | | | | | | | M I I I I I I I I I I I I I | | | 

Db 18 86 C CTTC AG AT C TG C AC C C C ACG ATG AC AG C TG C AC T C AT CT C C AAC AAGC CTG AGTT TG T G 194 5 

Qy 15 01 AAGCTCTTCCTGGAAAACGGGGTGCAGCTGAAGGAGTTTGTCACCTGGGACACCTTGCTC 156 0 

Db 1946 AAGCTCTTCCTGGAAAACGGGGTGCAGCTGAAGGAGTT 2 005 

Qy 15 61 TACCTGTACGAGAACCTGGACCCCTCCTGCCTGTTCCACAGCAAGCTGCAAAAGGTGCTG 162 0 

_ iiillilliillllMIIIIIIMIiillllilllllllllilllllliiiMMiiiii 

Db 2 0 06 TACCTGTACGAGAACCTGGACCCCTCCTGCCTGTTCCACAGCAAGCTGCAAAAGGTGCTG 2 06 5 



Qy 1621 GTGGAGGATCCCGAGCGCCCGGCTTGCGCGCCCGCGGCGCCCCGCCTGCAGATGCACCAC 168 0 

' 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GTGGAGGATCCCGAGCGCCCGGCTTGCGCGCCCGCGGCGCCCCGCCTGCAGATGCACCAC 212 5 



Db 2066 



Qy 16 31 GTGGCCCAGGTGCTGCGGGAGCTGCTGGGGGACTTCACGCAGCCGCTTTATCCCCGGCCC 174 0 

I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
Dr> 2126 GTGGCCCAGGTGCTGCGGGAGCTGCTGGGGGACTTCACGCAGCCGCTTTATCCCCGGCCC 218 5 



1741 CGGCACAACGA 1800 
CGGCACAACGACCGGCTGCGGCTCCTGCTGCCCGTTCCCCACGTCAAGCTC 224 5 



Db 2186 



°-y 1801 GGAGTGAGCCTCCGOT 1860 

GGAGTGAGCCTCCGGTCCCTCTACAAGCGTTCCTCAGGCCATGTGACCTTCA 



Db 2246 



2305 



Qy 1861 CCCATCCGTGACCTTCTCATTTGGGCCATTGTCCAGAACCGTCGGGAGCTGGCAGGAATC 1920 

I I I , I '' M I I I , 
CCCATCCGTGACCTTCTCATTTGGGCCATTGTCCAGAACCGTCGGGAGCTGGCAGGAATC 2 3 65 



Db 2306 



^ 1 92 1 ATCTGGGCTCAGAGCCAGGACTGCATCGCAGCGGCCTTGGCCTGC 1980 

Db 2366 ATCTGGGCTCAGAGCCAGGACTGCATCGCAGCGGCCTTGGCCTG^ 24 2 5 

^ i98 l GAACTGTCCA^ 2 04 0 

Db 24 26 GAACTGTCCAAGGAGGAGGAGGACACGGACAGCTCGGAGGAGAT^ 24 85 

Qy 2 0 4 1 GAGTATG AGCACAGAGCCATCGGGGTCTTCACCGAGTGCT ACCGG AAGG ACG AAG AGAGA 2100 



Db 2 4 86 GAGTATGAGCACAGAGCCATCGGGGTCTTCACCGAGTGCTACCGGAAGGACGAAGAGAGA 2 54 5 

Qy 2 '1°1 GCCCAGAAACTGCTCACCCGCGTGTCCGAGGCCTGGGGGAAGACCACCTGCCTC 216 0 

Db 2 54 6 GCCCAGAAACTGCTCACCCGCGTGTC^ 26 05 

Qy 2161 GCCCTGGAGGCCAAGGACATGAAGTTTGTGTCTCACGGGGGCATCCAGGCCTTCCTGACC 22 20 



Db 


2606 GCCCTGGAGGCCAAGGACATGAAGTTTGTGTCTCACGGGGGCATCCAGGCCTTCCTGACC 2 66 5 


Qy 


2221 AAGGTGTGGTGGGGCCAGCTCTCCGTGGACAATGGGCTGTGGCGTGTGACCCTGTGCATG 2 2 8 0 

„ illinium mi in nun iiiiiiiii Minimi minium mi 

2 666 AAGGTGTGGTGGGGCCAGCTCTCCGTGGACAATGGGCTGTGGCGTGTGACCCTGTGCATG 2 72 5 


Db 


Qy 


2 2 81 CTGGCCTTCCCGCTGCTCCTCACCGGCCTCATCTCCTTCAGGGAGAAGAGGCTGCAGGAT 2 34 0 


Db 


llllilllllillllMIIIIIIIIIIIIIMMMIillllMIIIMIIIIIIIIIII 

2 72 6 CTGGCCTTCCCGCTGCTCCTCACCGGCCTCATCTCCTTCAGGGAGAAGAGGCTGCAGGAT 2 7 85 


Qy 


2 3 41 GTGGGCACCCCCGCGGCCCGCGCCCGTGCCTTCTTCACCGCACCCGTGGTGGTCTTCCAC 2 4 00 
1 1 1 1 1 1 I If 1 i 1 1 M M ( 1 1 II 1 1 1 1 1 II 1 1 1 1 I 1 1 1 1 1 I 1 1 1 1 I 1 f 1 1 1 1 1 i 1 1 1 1 1 1 1 

' 1 ' 1 1 1 ! 1 1 | 1 t | | | || 1 1 1 1 1 1 I I 1 1 1 1 1 | | | | | 1 | | | 1 | | 1 1 | | | | | | | 1 [ | | | M 

2 7 86 GTGGGCACCCCCGCGGCCCGCGCCCGTGCCTTCTTCACCGCACCCGTGGTGGTCTTCCAC 2 845 


Db 


Qy 


24 01 CTGAACATCCTCTCCTACTTCGCCTTCCTCTGCCTGTTCGCCTACGTGCTCATGGTGGAC 2 4 6 0 
II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

JJJLi 11 1 1 11 1 1 11 1 1 1 1 1 1 1 1 1 I ' 1 1 1 t 1 1 II ! 1 I 1 I 1 M | | M | | | | | | [ | 1 II 1 It 1 

2 84 6 CTGAACATCCTCTCCTACTTCGCCTTCCTCTGCCTGTTCGCCTACGTGCTCATGGTGGAC 2 9 05 


Db 


Qy 


24 61 TTCCAGCCTGTGCCCTCCTGGTGCGAGTGTGCCATCTACCTCTGGCTCTTCTCCTTGGTG 2 5 2 0 


Db 


9Qnc Mill IN II III II II II MM III II II II II III II II IN INI Mill INI Ml 

2 90 6 TTCCAGCCTGTGCCCTCCTGGTGCGAGTGTGCCATCTACCTCTGGCTCTTCTCCTTGGTG 2 96 5 


Qy 


2 521 TGCGAGGAGATGCGGCAGCTCTTCTATGACCCTGACGAGTGCGGGCTGATGAAGAAGGCA 2 5 8 0 

M II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 l 1 1 I i 1 i i 1 1 1 1 1 i i i i 1 1 1 i i i i i i i i i 


Db 


1 1 1 1 1 ' ' i i 1 1 i i i i i i ii i i 1 1 ii 1 1 f i ii i ii I I I I I 1 1 1 1 1 I 1 1 I 1 1 1 | 1 1 1 1 1 1 1 1 

2966 TGCGAGGAGATGCGGCAGCTCTTCTATGACCCTGACGAGTGCGGGCTGATGAAGAAGGCA 3 02 5 


Qy 


2 5 81 GCCTTGTACTTCAGTGACTTCTGGAATAAGCTGGACGTCGGCGCAATCTTGCTCTTCGTG 264 0 

II 1 II 1 II 1 1 1 II 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 
i i ' i I I I I I i I I 1 1 l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ( | | | | | | | | | | 1 | | | J [ | [ | | | | | f | | [ j 

3026 GCCTTGTACTTCAGTGACTTCTGGAATAAGCTGGACGTCGGCGC7AATCTTGCTCTTCGTG 3085 


Db 


Qy 


2 641 GCAGGGCTGACCTGCAGGCTCATCCCGGCGACGCTGTACCCCGGGCGCGTCATCCTCTCT 2 700 

1 1 1 II M 1 1 II M II M II 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 


1 1 11 1 1 1 1 ' 1 1 I I I I I I I I I | | | | | | | [ | M 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | 
3086 GCAGGGCTGACCTGCAGGCTCATCCCGGCGACGCTGTACCCCGGGC.GCGTCATCCTCTCT 314 5 


Qy 


2 7 01 CTGGACTTCATCCTGTTCTGCCTCCGGCTCATGCACATTTTTACCATCAGTAAGACGCTG 2 76 0 
1 1 II 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 I 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 


Db 


11111111 1 1 1 1 i 1 1 1 II I I 1 ! 1 1 1 1 1 1 I 1 1 1 ) I 1 | | | | 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 ! 1 1 1 
314 6 CTGGACTTCATCCTGTTCTGCCTCCGGCTCATGCACATTTTTACCATCAGTAAGACGCTG 32 05 


Qy 


2761 GGGCCCAAGATCATCATTGTGAAGCGGATGATGAAGGACGTCTTCTTCTTCCTCTTCCTG 2 82 0 
1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 I i ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 


Db 


1 1 1 1 1 1 1 1 ' 1 1 1 1 i 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I 1 1 11 I 1 1 1 I I I | | [ | | | | | | | | | | | j | 
32 06 GGGCCCAAGATCATCATTGTGAAGCGGATGATGAAGGACGTCTTCTTCTTCCTCTTCCTG 3265 


Qy 


2 821 CTGGCTGTGTGGGTGGTGTCCTTCGGGGTGGCCAAGCAGGCCATCCTCATCCACAACGAG 2880 


Db 


MMIMIIMMMMIMMMMMMMMMIMMMMMMMMMMIM 

32 66 CTGGCTGTGTGGGTGGTGTCCTTCGGGGTGGCCAAGCAGGCCATCCTCATCCACAACGAG 3 3 2 5 


Qy 


2 8 81 CGCCGGGTGGACTGGCTGTTCCGAGGGGCCGTCTACCACTCCTACCTCACCATCTTCGGG 2 94 0 
II II 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 i 1 1 1 1 I 1 1 1 1 


Db 


1 1 1 1 1 i i i i m m i i i i i i i i | | | m 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 M I 1 1 I 1 1 1 1 1 1 | | | 
3326 CGCCGGGTGGACTGGCTGTTCCGAGGGGCCGTCTACCACTCCTACCTCACCATCTTCGGG 33 8 5 


Qy 


2941 CAGATCCCGGGCTACATCGACGGTGTGAACTTCAACCCGGAGCACTGCAGCCCCAATGGC 3 0 00 

MMMMIMMMMMIMMMMMMMMMMMMMMMMMMIMI 

33 8 6 CAGATCCCGGGCTACATCGACGGTGTGAACTTCAACCCGGAGCACTGCAGCCCCAATGGC 3445 


Db 


Qy 


3 0 01 ACCGACCCCTACAAGCCTAAGTGCCCCGAGAGCGACGCGACGCAGCAGAGGCCGGCCTTC 3 0 6 0 


Db 


1 M M M 1 M M II M II M II M M 1 M II M M 1 1 M 1 M 1 M 1 1 M M 1 M 1 1 1 M 1 " 

34 4 6 ACCGACCCCTACAAGCCTAAGTGCCCCGAGAGCGACGCGACGCAGCAGAGGCCGGCCTTC 3 5 0 5 


Qy 


3 0 61 CCTGAGTGGCTGACGGTCCTCCTACTCTGCCTCTACCTGCTCTTCACCAACATCCTGCTG 312 0 

1 II I 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 M 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I i i i i 
M 1 ' »' i ii i i i i i f i ii 1 i 1 1 II II 1 II I II 1 1 1 1 1 1 II 1 1 1 I ( I || | | | | | | | J | | J | 

3 5 0 6 CCTGAGTGGCTGACGGTCCTCCTACTCTGCCTCTACCTGCTCTTCACCAACATCCTGCTG 3 565 


Db 


Qy 


3121 CTCAACCTCCTCATCGCCATGTTCAACTACACCTTCCAGCAGGTGCAGGAGCACACGGAC 3180 

| 1 1 1 1 i 1 1 1 \ 1 1 1 1 | 1 I 1 1 I 1 1 1 1 1 1 r 1 1 i 1 1 r | 1 | 1 | ill , , , , . 

M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 II II 1 1 1 II 1 1 1 1 1 1 M 1 II 

3 56 6 CTC AACCTCCTCATCGCCATGTTCAACTACACCTTCCAGCAGGTGCAGGAGCACACGGAC 3 62 5 


Db 


Qy 


3181 CAGATTTGGAAGTTCCAGCGCCATGACCTGATCGAGGAGTACCACGGCCGCCCCGCCGCG 3 24 0 


Db 


M M M M 1 II M M 1 1 1 1 1 1 M M M II M II 1 M II M M II 1 M M II 1 1 II II 1 M 

3 62 6 CAGATTTGGAAGTTCCAGCGCCATGACCTGATCGAGGAGTACCACGGCCGCCCCGCCGCG 36 8 5 


Qy 


3241 CCGCCCCCCTTCATCCTCCTCAGCCACCTGCAGCTCTTCATCAAGAGGGTGGTCCTGAAG 3 3 00 



Db 3686 CCGCCCCCCTTCATCCTCCTCAGCCACCTGCAGCTCTTCATCAAGAGGGTGGTCCTGAAG 3 74 5 

Qy 3 301 AC T C CGG CC AAG AGG C AC AAG C AG CT C AAG AAC AAG CTGG AGAAG AA CG AGGAGG CGG C C 3360 

Db 3 74 6 ACTCCGGCCAAGAGGCACAAGCAGCTCAAGAACAAGCTGGAGAAGAACGAGGAGGCGGCC 3805 

Qy 3361 CTGCTATCCTGGGAGATCTACCTGAAGGAGAACTACCTCCAGAACCGACAGTTCCAGCAA 34 2 0 

Db 3 8 06 CTGCTATCCTGGGAGATCTACCT 3865 

Qy 3421 AAGCAGCGGCCCGAGCAGAAGATCGAGGACATCAGCAATAAGGTTGACGCCATGGTGGAC 34 8 0 



Db 3 866 AAG C AG CGG C C CG AG C AG AAG AT CG AGG AC AT C AGCAAT AAG^ 3 92 5 

Qy 3481 CTGCTGGACCTGGACCCACTGAAGAGGTCGGGCTCCATGGAGCAGAGGTTGGCCTCCCTG 3540' 

Db 3 92 6 CTGCTGGACCTGGACCCACTGAAGAGGTCGGGCTCCATGGAG^ 3 98 5 

Qy 3 541 GAGGAGCAGGTGGCCCAGACAGCCCGAGCCCTGCACTGGATCGTGAGGACGCTGCGGGCC 36 00 

Db 3 986 GAGGAGCAGGTGGCCCAGACAGCCCGAGCCCTGCACTGGATCGTGAGG^ 4 04 5 

Qy 3601 AGCGGCTTCAGCTCGGAGGCGGACGTCCCCACTCTGGCCTCCCAGAAGGCCGCGGAGGAG 3 660 

Db 4046 AGCGGCTTCAGCTCGGAGGCGGACGTCCCCACTCTGGCCT^ 4105 

Qy 3661 CCGGATGCTGAGC^ 3720 

Db 4106 CCGGATGCTGAGCCGGGAGGCAGGAAGAAGAC^ 4165 

Qy 3 721 AATGCCCGGCACCTCCTCTACCCCAACTGCCCTGTCACGCGCTTCCCCGTGCCCAACGAG 3 780 

Db 4166 AATGCCCGGCACCTCCTCTACCCC^ 4 22 5 

Qy 3 781 AAGGTG<X^ 3840 

Db 4 2 26 AAGGTGCCCTGGGAGACGGAGTTCCTGATCTATGACCCACCCTTTTACAC^ 42 85 

^ r 3841 AAGGACGCGG^ 3900 

Db 4 2 86 AAGGACGCGGCCGCCATGGACCCCATGGGAGAC^ 4345 

Qy 3 901 TACAACGTGGTGGATGGCCTGAGGGACCGCCGGAGCTTCCACGGGCCGTACACAGTGCAG 3 96 0 

Db 4 34 6 TACAACGTGGTGGATGGCCTGAGGGACCGC^ 4405 

Qy 3961 GCCGGGTTGCCCCTGAACCCCATGGGCCGCACAGGACTC 40 2 0 

Db 44 06 GCCGGGTTGCCCCTGAACCCCATGGGCCGCACAGGACTC 44 65 

Qy 4021 TGCTTCGGACCCAACCACACGCTGTACCCCATGGT 4080 

Db 44 66 TGCTTCGGACCCAACCACArcCTGT 4 525 

Qy 4 081 GGAGCCATCTGCAGGAAGAGCATAAAGAAGATGCTGGAAGTGCTGGTGGTGAAGCTCCCT 414 0 

liliillMIIIIMIMIIIIIMIIIIIIIIIIMIIIIilillllllllllllllll 

Db 4 52 6 GGAGCCATCTGCAGGAAGAGCATAAAGAAGATGCTGGAAGTGCTGGTGGTGAAGCTCCCT 45 85 

Qy 4141 CTCTCCGAGCACTGGGCCCTGCCTGGGGGCTCCCGGGAGCCAGGGGAGATGCTACCTCGG 42 00 

' I ' > ' | I 'I 

Db 4 58 6 CTCTCCGAGCACTGGGCCCTGCCTGGGGGCTCCCGGGAGCCAGGGGAGATGCTACCTCGG 4 64 5 

Qy 4 2 01 AAGCTGAAGCGGATCCTCCGGCAGGAGCACTGGCCGTCTTTTGAAAACTTGCTGAAGTGC 4 26 0 

1 1 1 1 1 i 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 

Db 4 64 6 AAGCTGAAGCGGATCCTCCGGCAGGAGCACTGGCCGTCTTTTGAAAACTTGCTGAAGTGC 4 7 05 

Qy 4 261 GGCATGGAGGTGTACAAAGGCTACATGGATGACCCGAGGAACACGGACAATGCCTGGATC 43 2 0 

Ann£ in i in ii i mi iii ii i ii i iii 1 1 ii ii nun ii 1 1 H M M inn i 

Db 4 706 GGCATGGAGGTGTACAAAGGCTACATGGATGACCCGAGGAACACGGACAATGCCTGGATC 4765 



Qy 43 21 GAGACGGTGGCCGTCAGCGTCCACTTCCAGGACCAGAATGACGTGGAGCTGAACAGGCTG 4380 

Db 4 76 6 GAGACGGTGGCCGTCAGCGTCCACTTCCAGG^ 4 8 2 5 

QY 4 3 81 AACTCTAACCTGCACGC^ 4 4 4 0 

Db 4 82 6 AACTCTAACCTGCACGCCTGCGACTCGGGGGCCTCCATrc 4 885 

QY 4441 CGCATCCCACTCTAT^ 4500 

Db 4886 CGCATCCCACTCTATGCGAACCACAAGA^ 4945 

Qy 4 501 GCTCACTACTGA 4512 

Db 4 94 6 GCTCACTACTGA 4 957 
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Hiromichi; Nozawa, Katsura; Mochizuki, Shinobu 
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PCT Int. Appl., 98 pp. 
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Screening for modulators of human Ca2+-ATP regulated 
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2002 : 368512 CAPLUS 
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Protein and cDNA sequences of a novel human long 
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PCT Int. Appl., 60 pp. 
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